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AGATCTGTGG CTTCCGGTTG GCTACTTGTA ACCAACTGAT GGTCAGATGG ATCTGCCGTC 60 
TGTTTTGATT TGAATTTTCC CTGCTCATTC TGATTCTGTG AGAGGCTGCA TTCATTATCA 120 
CATCTCATAC CCGGCGCCTG CGACTTCGGT CACCTCTGCG GTCTGGCGGT TACCGGGGTC 180 
CGTCTGAGAC TCGTCAGTCA GCCATTCGAG TATGCGAACT CTGACTTTGC TCACCTAAGA 240 
GTTTGCACGA GATGCCGAAA TCCTCCTCGA GTAGAGTTTG CAAGGCTTGA ACCTTGGTCC 300 
TTGAAGCCCG AAAGTGGCTC AGTAGTGGGA TCGATAGTCT GGTTGTTGAA GATTTTCTCC 360 
TCCACCTTAC CTATGGCCGC TGGCCTTCTC CACCTTTCAG GCTTTCAGGC ACCCTCGGCT 420 
CGGATTCTGT ATCGTCCGGT ACCGAAGCTA GTCCTAGCTA GTCAAAGCTA GTCCAAGCTA 4 80 
GTCTCGTCAA GGTTTGGCGC AGCGCGGTTC CGTGTAAAGT ACAAATTTGA AATACGAATA 54 0 
CGCAGTACTC GCAGCCGGCA CTTCCGCTCA GCCCAGGCTC AGAGGCTAAG GGTGTTGGCG 600 
CTTCCTCATC ATCTTCTTCT CGTCGACCTT TTCCTCTTTC TCTCCCTATC GGTGCTTCTC 660 
TCCAACCTCA TTCTCAGTCG TTCGCCCATC AGGTTTATAC TCCGGCTCCG TGGCCATCTG 720 
CCTCCCTCAC GACCTCCTCG TTCCAGGTTT TCCTCTCGAC TGCTGCGCCC TTGCACTTCG 7 80 
CCTTGCATCA GTGAAACCCC CTGCAACGTG ACGGCTCAAA GACATCCTCG TTTGGCCGCT 840 
GGAGACCGGA GCGTGCGCTT CGTTTCGTCT TCTTCGAACC GATCTCAATT TCCCCGCTCG 900 
GGTTGACGCC GTCAGCACCC TGCTCGTTGC CTAACGGCTT GTTATTCAAG ACCCCTTTTC 960 
TGCCGCTTCC GCGACCGATT TATTCGTCGC CTTCCAACTC TTGTACAATC GGGGGGAAAG 1020 
AAAGCAGACG GAGTTCGATC TGGAGGAATT ATAGCTGAGT CTTGCCCGCA AGACTCGCCG 1080 
CAACCATGAA TCAAACACTT CCCACGTGGA AGGACCGCAC GCAGAACCAG TTTGGAAAGC 114 0 
TTCAGATCCA GGTTCCATGG CGGTCCATCC AACTGCTCGT CCCGCATCGC ATGCGGCGGA 1200 
AGTTAAGGTC CAAATTGCGC AGTAGAGCGT CTCCTACCTC GTCAATAGCG TCTTTACAGA 12 60 
CGTCGTTATC GCCTGCAGAC ACACTACGAT CGCTCCAAAG CCACCGATGG ACGGTTTACG 1320 
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ACTTCCAATA TCTGCTTCTG TTGATCGTGG GCATCTTCTC TTTGACCGTT ATCGAGTCGC 1380 
CCGGGCCTTT GGGCAAAACG GCCATTTTCT CCATGCTCCT ATTCTCTCTC CTGATCCCTA 14 4 0 
TGACCCGCCA GTTCTTCCTC CCGTTTCTGC CGATTGCCGG ATGGCTTCTG TTTTTCTACG 1500 
CCTGCCAGTG AGTTAAAAAC AACCCGCTAC CAGACCCCGT GCAGCAGTTA CTCACATATG 1560 
CAGGTTCATC CCAAGCGATT GGCGCCCTGC GATTTGGGTT CGTGTCTTGC CTGCACTGGA 1620 
GAATATTCTC TACGGCGCAA ACATCAGCAA CATCCTATCC GCTCACCAGA ACGTTGTGCT 1680 
TGACGTGCTG GCGTGGCTAC CCTACGGTAT CTGCCACTAT GGCGCTCCGT TTGTGTGCTC 174 0 
GTTGATCATG TTCATCTTCG GTCCGCCCGG CACTGTTCCC CTTTTCGCGC GCACTTTCGG 1800 
CTATATCAGT ATGACTGCGG TTACTATTCA GCTGTTTTTC CCTTGCTCTC CACCTTGGTA 1860 
TGAGAATCGC TATGGTCTAG CTCCGGCAGA CTACTCCATC CAAGGTGATC CCGCAGGGCT 1920 
TGCCCGCATT GACAAGCTTT TCGGCATCGA CCTTTACACG TCTGTTTTCC ATCAGTCGCC 1980 
TGTTGTGTTC GGCGCTTTTC CGTCGCTGCA TGCTGCCGAC TCAACCCTGG CCGCACTTTT 2040 
CATGAGTCAT GTTTTCCCCC GCATGAAGCC CGTCTTCGTG ACCTATACTC TATGGATGTG 2100 
GTGGGCAACA ATGTACCTCT CACATCACTA TGCGGTCGAT TTGGTTGCGG GTGGTCTCCT 2160 
GGCCGCCATT GCTTTCTACT TCGCCAAGAC CCGATTCCTT CCCCGTGTCC AGCTCGACAA 2220 
GACCTTCCGT TGGGACTACG ACTATGTGGA ATTCGGCGAG TCTGCCCTGG AGTATGGGTA 2280 
TGGTGCAGCT GGCTATGATG GAGACTTCAA TCTCGACAGC GATGAATGGA CTGTTGGTTC 234 0 
TTCATCCTCC GTCTCCTCAG GCTCCTTGAG TCCCGTTGAC GATCATTACT CATGGGAAAC 24 00 
CGAGGCACTG ACCTCCCCAC ATACTGATAT TGAGTCCGGC AGGCATACTT TCAGCCCTTG 24 60 
AGTAGCCACA AACCAAACTC GATACCTGCA TATAGCGATC TCGCTCCTCC TCCACTGCAT 2520 
CTATTTACGA GACGGCGTTA GAACATTTCA CGACATTCTG GCTTTATTGC ATCGAGCACA 2580 
TTTCGACACA TATATCTTTA ATACCCTTTC TTCGGTGTCC CAGATCATCG GTTCGACCTT 2 640 
AATGTACCTC GGTCCGAATC CGCCTGGGAT ACTGTTTCTC TTTCCGCCGC ACTTCACTGT 2700 
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ACATTGCTTG ACATTGCGAA ACCGGGTTGG GCTCGAACGT GGGATGGGTT ATCGCTCATC 27 60 
GCTACACGCC GTTGCTCCAT CATAATGTTA ATGGACACAA TGGGGCTACG CATCCTGGTG 2820 
TTTAGTCCTG GAAGACCATC CGATAACCCC CGTCGGTAAC ACTCGCTTGT CTCGTGTCCA 2880 
CCCAGACACT ACTTCAATTC TCACTTCTAT CGTCCGCTAT TACCTTGACC TGGTCGAACC 294 0 
CATCCTTATT ATTCGTTTCG ACTATGCTAT ATATTTATTT TTACCATTCG TGTCGATCGC 3000 
TCATACTCTT GGCGCTTGGG ACTGGAAGCA TTTATATTGG AAAAAATCAC GGAATGGGGC 3060 
GCCTTTTCTT CTTGCACTTC ACTCGCTGTG CATAGACGGT TTTACATTTC TGCTTTGCAA 3120 
TGCATCACGA ACTCTGCATT AGCATATAGA AAGAGGGGAA GGATGGACCT TCTTCTTGAT 3180 
TGCTCGCATG GTTTATCCAT TCGCTCAAAG TGGATTACGT CCACATATTA CCCGGGGGCT 3240 
ATACACATGG CTACTGTGTT GCTTTCTGAC ATTCGCCGGA CGTGCAAGGT TGGGAGGAGA 3300 
GTCTGACGCT GACGGGGCTT GTTGAAGGAT GTTCACGCGT CCCGATTTGA CCCGGCTTCG 33 60 
ACTAACCTCA GATTCTCGAC TTGTTGGACG GTGACTTGAC TTGCTTGCTA TGGTCTGACG 3420 
CTCTCACACC TACCTATCAC ATCCTCCTCA CCTCACAAAT TCCGCTCATG GACACTATCC 3480 
TCTTCTTTTC GTTTCCCTTG GATAGTGTGT GTGTGTGTGT GGTTGGGGCA AATTATCCAT 354 0 
AGCAGCAGTA TTATTAGTTA TAATCCGGTA GTGTTATGAT TTATGAAGGC AACTTGTATA 3600 
CTATTGCCAC TTTGTCCATA TCTCTTGCTT GTAATAGAAC TGACATCGCG ACGCTTCGGC 3660 
CACGATGCAT ATAAAAACTC TAGTCAACAC GATATTAACA AGCGAAACCA TTACGCTGTA 3720 
AACTATTCAG GATCGCCGCG GGCCCATCTG GGACTTGACT GTACTAAATA TGTCCTAAAG 3780 
CAAGCAGACT AAATATTTAA CGTGGGATAT TATTCATATA CGCATATGTA TACATAGTCA .384 0 
TAACAAGCCA AGGGGTGGGT AGGGGTGGGT AATTATTATT TTTTTTCGTC GATACAAGTA 3900 
TCCATCCTTA AATGTCCGTG GTCTACTCTT CATAAATCTT AACCCGCTCC GCATACTCCT 3960 
TTATCCTCGA GACAAAAGTG TCTTCAATTT CATCGCCACG GCCACCAGCA ACGCGGAGGA 4 020 
TAAGGTCGTT GAGAGAGGCG CGGCCGAGAA TGTCGACATG GTCGCCTTCA TATTGTTAGC 4 080 
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ATGCGACGTC AGACTGGAAC CAGGAAGGGA AAGGAGAGAG GTACCTGTAT TTGGACCACC 414 0 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Asn Gin Thr Leu Pro Thr Trp Lys Asp Arg Thr Gin Asn Gin 
1 5 10 15 

Phe Gly Lys Leu Gin lie Gin Val Pro Trp Arg Ser lie Gin Leu 

20 25 30 

Leu Val Pro His Arg Met Arg Arg Lys Leu Arg Ser Lys Leu Arg 

35' 40 45 

Ser Arg Ala Ser Pro Thr Ser Ser lie Ala Ser Leu Gin Thr Ser 

50 55 60 

Leu Ser Pro Ala Asp Thr Leu Arg Ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr Val Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie Val Gly He 

80 85 90 
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Phe Ser Leu Thr Val lie Glu Ser Pro Gly Pro Leu Gly Lys Thr 

95 100 105 

Ala lie Phe Ser Met Leu Leu Phe Ser Leu Leu lie Pro Met Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu Leu 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp Val Arg Val Leu Pro Ala Leu Glu Asn lie Leu Tyr Gly 

155 160 165 

Ala Asn lie Ser Asn lie Leu Ser Ala His Gin Asn Val Val Leu 

170 175 180 

Asp Val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe Val Cys Ser Leu lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 

Thr Val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Thr 

215 220 225 

Ala Val Thr lie Gin Leu Phe Phe Pro Cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Arg Tyr Gly Leu Ala Pro Ala Asp Tyr Ser lie Gin Gly 

245 250 255 

Asp Pro Ala Gly Leu Ala Arg lie Asp Lys Leu Phe Gly lie Asp 
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260 265 270 

Leu Tyr Thr Ser Val Phe His Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser His Val Phe Pro Arg Met Lys Pro Val Phe Val Thr Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala Val Asp Leu Val Ala Gly Gly Leu Leu Ala Ala lie Ala Phe 

335 340 345 

Tyr Phe Ala Lys Thr Arg Phe Leu Pro Arg Val Gin Leu Asp Lys 

350 355 360 

Thr Phe Arg Trp Asp Tyr Asp Tyr Val Glu Phe Gly Glu Ser Ala 

365 370 375 

Leu Glu Tyr Gly Tyr Gly Ala Ala Gly Tyr Asp Gly Asp Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr Val Gly Ser Ser Ser Ser Val Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro Val Asp Asp His Tyr Ser Trp Glu Thr 

410 415 420 

Glu Ala Leu Thr Ser Pro His Thr Asp lie Glu Ser Gly Arg His 

425 430 435 
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Thr Phe Ser Pro 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GGTTTATACT CCGGCTCCGT GGCCATCTGC CTCCCTCACG ACCTCCTCGT TCCAGGTTTT 60 

CCTCTCGACT GCTGCGCCCT TGCACTTCGC CTTGCATCAG TGAAACCCCC TGCAACGTGA 120 

CGGCTCAAAG ACATCCTCGT TTGGCCGCTG GAGACCGGAG CGTGCGCTTC GTTTCGTCTT 180 

CTTCGAACCG ATCTCAATTT CCCCGCTCGG GTTGACGCCG TCAGCACCCT GCTCGTTGCC 24 0 

TAACGGCTTG TTATTCAAGA CCCCTTTTCT GCCGCTTCCG CGACCGATTT ATTCGTCGCC 300 

TTCCAACTCT TGTACAATCG GGGGGAAAGA AAGCAGACGG AGTTCGATCT GGAGGAATTA 360 

TAGCTGAGTC TTGCCCGCAA GACTCGCCGC AACCATGAAT CAAACACTTC CCACGTGGAA 420 

GGACCGCACG CAGAACCAGT TTGGAAAGCT TCAGATCCAG GTTCCATGGC GGTCCATCCA 480 

ACTGCTCGTC CCGCATCGCA TGCGGCGGAA GTTAAGGTCC AAATTGCGCA GTAGAGCGTC 54 0 

TCCTACCTCG TCAATAGCCT CTTTACAGAC GTCGTTATCG CCTGCAGACA CACTACGATC 600 

GCTCCAAAGC CACCGATGGA CGGTTTACGA CTTCCAATAT CTGCTTCTGT TGATCGTGGG 660 
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CATCTTCTCT TTGACCGTTA TCGAGTCGCC CGGGCCTTTG GGCAAAACGG CCATTTTCTC 720 
CATGCTCCTA TTCTCTCTCC TGATCCCTAT GACCCGCCAG TTCTTCCTCC CGTTTCTGCC 780 
GATTGCCGGA TGGCTTCTGT TTTTCTACGC CTGCCAGTTC ATCCCAAGCG ATTGGCGCCC 840 
TGCGATTTGG GTTCGTGTCT TGCCTGCACT GGAGAATATT CTCTACGGCG CAAACATCAG 900 
CAACATCCTA TCCGCTCACC AGAACGTTGT GCTTGACGTG CTGGCGTGGC TACCCTACGG 960 
TATCTGCCAC TATGGCGCTC CGTTTGTGTG CTCGTTGATC ATGTTCATCT TCGGTCCGCC 1020 
CGGCACTGTT CCCCTTTTCG CGCGCACTTT CGGCTATATC AGTATGACTG CGGTTACTAT 1080 
TCAGCTGTTT TTCCCTTGCT CTCCACCTTG GTATGAGAAT CGCTATGGTC TAGCTCCGGC 114 0 
AGACTACTCC ATCCAAGGTG ATCCCGCAGG GCTTGCCCGC ATTGACAAGC TTTTCGGCAT 1200 
CGACCTTTAC ACGTCTGTTT TCCATCAGTC GCCTGTTGTG TTCGGCGCTT TTCCGTCGCT 1260 
GCATGCTGCC GACTCAACCC TGGCCGCACT TTTCATGAGT CATGTTTTCC CCCGCATGAA 1320 
GCCCGTCTTC GTGACCTATA CTCTATGGAT GTGGTGGGCA ACAATGTACC TCTCACATCA 1380 
CTATGCGGTC GATTTGGTTG CGGGTGGTCT CCTGGCCGCC ATTGCTTTCT ACTTCGCCAA 14 4 0 
GACCCGATTC CTTCCCCGTG TCCAGCTCGA CAAGACCTTC CGTTGGGACT ACGACTATGT 1500 
GGAATTCGGC GAGTCTGCCC TGGAGTATGG GTATGGTGCA GCTGGCTATG ATGGAGACTT 1560 
CAATCTCGAC AGCGATGAAT GGACTGTTGG TTCTTCATCC TCCGTCTCCT CAGGCTCCTT 1620 
GAGTCCCGTT GACGATCATT ACTCATGGGA AACCGAGGCA CTGACCTCCC CACATACTGA 1680 
TATTGAGTCC GGCAGGCATA CTTTCAGCCC TTGAGTAGCC ACAAACCAAA CTCGATACCT 17 4 0 
GCATATAGCG ATCTCGCTCC TCCTCCACTG CATCTATTTA CGAGACGGCG TTAGAACATT 1800 
TCACGACATT CTGGCTTTAT TGCATCGAGC ACATTTCGAC ACATATATCT TTAATACCCT 18 60 
TTCTTCGGTG TCCCAGATCA TCGGTTCGAC CTTAATGTAC CTCGGTCCGA ATCCGCCTGG 1920 
GATACTGTTT CTCTTTCCGC CGCACTTCAC TGTACATTGC TTGACATTGC GAAACCGGGT 1980 
TGGGCTCGAA CGTGGGATGG GTTATCGCTC ATCGCTACAC GCCGTTGCTC CATCATAATG 2040 



-86- 



TTAATGGACA CAATGGGGCT ACGCATCCTG GTGTTTAGTC CTGGAAGACC ATCCGATAAC 2100 
CCCCGTCGGT AACACTCGCT TGTCTCGTGT CCACCCAGAC ACTACTTCAA TTCTCACTTC 2160 
TATCGTCCGC TATTACCTTG ACCTGGTCGA ACCCATCCTT ATTATTCGTT TCGACTATGC 2220 
TATATATTTA TTTTTACCAT TCGTGTCGAT CGCTCATACT CTTGGCGCTT GGGACTGGAA 2280 
GCATTTATAT TGGAAAAAAT CACGGAATGG GGCGCCTTTT CTTCTTGCAC TTCACTCGCT 2340 
GTGCATAGAC GGTTTTACAT TTCTGCTTTG CAATGCATCA CGAACTCTGC ATTAGCATAT 24 00 
AGAAAGAGGG GAAGGATGGA CCTTCTTCTT GATTGCTCGC ATGGTTTATC CATTCGCTCA 24 60 
AAGTGGATTA CGTCCACATA TTACCCGGGG GCTATACACA TGGCTACTGT GTTGCTTTCT 2520 
GACATTCGCC GGACGTGCAA GGTTGGGAGG AGAGTCTGAC GCTGACGGGG CTTGTTGAAG 2580 
GATGTTCACG CGTCCCGATT TGACCCGGCT TCGACTAACC TCAGATTCTC GACTTGTTGG 2640 
ACGGTGACTT GACTTGCTTG CTATGGTCTG ACGCTCTCAC ACCTACCTAT CACATCCTCC 2700 
TCACCTCACA AATTCCGCTC ATGGACACTA TCCTCTTCTT TTCGTTTCCC TTGGATAGTG 2760 
TGTGTGTGTG TGTGGTTGGG GCAAATTATC CATAGCAGCA GTATTATTAG TTATAATCCG 2820 
GTAGTGTTAT GATTTATGAA GGCAACTTGT ATACTA 2856 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 4: 
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Met Asn Gin Thr Leu Pro Thr Trp Lys Asp Arg Thr Gin Asn Gin 
15 10 15 

Phe Gly Lys Leu Gin lie Gin Val Pro Trp Arg Ser lie Gin Leu 

20 25 30 

Leu Val Pro His Arg Met Arg Arg Lys Leu Arg Ser Lys Leu Arg 

35 40 45 

Ser Arg Ala Ser Pro Thr Ser Ser lie Ala Ser Leu Gin Thr Ser 

50 55 60 

Leu Ser Pro Ala Asp Thr Leu Arg Ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr Val Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie Val Gly lie 

80 85 90 

Phe Ser Leu Thr Val lie Glu Ser Pro Gly Pro Leu Gly Lys Thr 

95 100 105 

Ala lie Phe Ser Met Leu Leu Phe Ser Leu Leu lie Pro Met Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu Leu 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp Val Arg Val Leu Pro Ala Leu Glu Asn lie Leu Tyr Gly 

155 160 165 

Ala Asn lie Ser Asn lie Leu Ser Ala His Gin Asn Val Val Leu 
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170 175 180 

Asp Val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe Val Cys Ser Leu lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 

Thr Val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Thr 

215 220 225 

Ala Val Thr lie Gin Leu Phe Phe Pro Cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Arg Tyr Gly Leu Ala Pro Ala Asp Tyr Ser lie Gin Gly 

245 250 255 

Asp Pro Ala Gly Leu Ala Arg lie Asp Lys Leu Phe Gly lie Asp 

260 265 270 

Leu Tyr Thr Ser Gly Phe His Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser His Val Phe Pro Arg Met Lys Pro Val Phe Val Thr Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala Val Asp Leu Val Ala Gly Gly Leu Leu Ala Ala He Ala Phe 

335 340 345 
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Tyr Phe Ala Lys Thr Arg Phe Leu Pro Arg Val Gin Leu Asp Lys 

350 355 360 

Thr Phe Arg Trp Asp Tyr Asp Tyr Val Glu Phe Gly Glu Ser Ala 

365 370 375 

Leu Glu Tyr Gly Tyr Gly Ala Ala Gly Tyr Asp Gly Asp Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr Val Gly Ser Ser Ser Ser Val Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro Val Asp Asp His Tyr Ser Trp Glu Thr 

410 415 420 

Glu Ala Leu Thr Ser Pro His Thr Asp lie Glu Ser Gly Arg His 

425 430 435 

Thr Phe Ser Pro 



INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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Met Asn Thr Thr Leu Pro Ser Trp Lys Asp Arg Thr Gin Asn Gin 
15 10 15 

Phe Gly Lys Leu Gin lie Gin Val Pro Trp Arg Thr lie Gin Leu 

20 25 30 

Leu Val Pro His Arg Met Arg Arg Lys lie Arg Ser Lys Leu Arg 

35 40 45 

Ser Arg lie Ser Pro Thr Ser Ser lie Ser Ser Leu Gin Thr Ser 

50 55 60 

Phe Ser Pro Val Asp Thr Leu Arg Ser Leu Gin Ser His Arg Trp 

65 70 75 

Thr Leu Tyr Asp Phe Gin Tyr Leu Leu Leu Leu lie Val Gly lie 

80 85 90 

Phe Ser Leu Ser Val Met Glu Ser Pro Gly Pro Leu Ala Lys Thr 

95 100 105 

Ala Ala Phe Thr Leu Leu Leu Val Ser Leu Leu Leu Pro lie Thr 

110 115 120 

Arg Gin Phe Phe Leu Pro Phe Leu Pro lie Ala Gly Trp Leu lie 

125 130 135 

Phe Phe Tyr Ala Cys Gin Phe lie Pro Ser Asp Trp Arg Pro Ala 

140 145 150 

lie Trp Val Arg Val Leu Pro Ala Leu Glu Asn lie Leu Tyr Gly 

155 160 165. 

Ala Asn lie Ser Asn lie Leu Ser Ala His Gin Asn Val Val Leu 
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170 175 180 

Asp Val Leu Ala Trp Leu Pro Tyr Gly lie Cys His Tyr Gly Ala 

185 190 195 

Pro Phe Val Cys Ser Ala lie Met Phe lie Phe Gly Pro Pro Gly 

200 205 210 

Thr Val Pro Leu Phe Ala Arg Thr Phe Gly Tyr lie Ser Met Ala 

215 220 225 

Ala Val Thr lie Gin Leu Phe Phe Pro Cys Ser Pro Pro Trp Tyr 

230 235 240 

Glu Asn Leu Tyr Gly Leu Ala Pro Ala Asp Tyr Ser Met Pro Gly 

245 250 255 

Asn Pro Ala Gly Leu Ala Arg lie Asp Glu Leu Phe Gly lie Asp 

260 265 270 

Leu Tyr Thr Ser Gly Phe Arg Gin Ser Pro Val Val Phe Gly Ala 

275 280 285 

Phe Pro Ser Leu His Ala Ala Asp Ser Thr Leu Ala Ala Leu Phe 

290 295 300 

Met Ser Gin Val Phe Pro Arg Leu Lys Pro Leu Phe Val lie Tyr 

305 310 315 

Thr Leu Trp Met Trp Trp Ala Thr Met Tyr Leu Ser His His Tyr 

320 325 330 

Ala Val Asp Leu Val Gly Gly Gly Leu Leu Ala Thr Val Ala Phe 

335 340 345 
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Tyr Phe Ala Lys Thr Arg Phe Met Pro Arg Val Gin Asn Asp Lys 

350 355 360 

Met Phe Arg Trp Asp Tyr Asp Tyr Val Glu Tyr Gly Asp Ser Ala 

365 370 375 

Leu Asp Tyr Gly Tyr Gly Pro Ala Ser Phe Glu Gly Glu Phe Asn 

380 385 390 

Leu Asp Ser Asp Glu Trp Thr Val Gly Ser Ser Ser Ser lie Ser 

395 400 405 

Ser Gly Ser Leu Ser Pro Val Asp Asp His Tyr Ser Trp Glu Gly 

410 415 420 

Glu Thr Leu Ala Ser Pro Ala Thr Asp lie Glu Ser Gly Arg His 

425 430 435 

Phe 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Met Ala Asn Pro Phe 
1 5 
Asn Cys His Val Ala 

20 

Thr Leu Leu Lys Val 

35 

Val His Tyr lie Phe 

50 

Thr Asn Pro Ala Pro 

65 

Leu Gly Thr Leu Phe 

80 

Asn Ala Leu Pro lie 

95 

Ser Tyr Phe Pro Asp 

110 

Leu Pro Ala Val Glu 

125 

lie Leu Ala Thr Ser 

140 

Leu Pro Tyr Gly Leu 

155 

Ala He Leu Phe Val 



Ser Arg Trp Phe Leu 

10 

Asp Leu Glu Thr Ser 

25 

Gin Lys Tyr Lys Pro 

40 

Leu Gly Ser He Met 

55 

Trp He Phe Lys He 

70 

He He Pro Ala Thr 

85 

Leu Thr Trp Val Ala 

100 

Asp Arg Arg Pro Pro 

115 

Thr He Leu Tyr Gly 

130 

Thr Asn Ser Phe Leu 

145 

Phe His Phe Gly Ala 

160 

Phe Gly Pro Pro Thr 



Ser Glu Arg Pro Pro 

15 

Leu Asp Pro His Gin 

30 

Ala Leu Ser Asp Trp 

45 

Leu Phe Val Phe He 

60 

Leu Phe Tyr Cys Phe 

75 

Ser Gin Phe Phe Phe 

90 

Leu Tyr Phe Thr Ser 

105 

He Thr Val Lys Val 

120 

Asp Asn Leu Ser Asp 

135 

Asp He Leu Ala Trp 

150 

Pro Phe Val Val Ala 

165 

Val Leu Gin Gly Tyr 
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170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Val lie Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys lie Leu Tyr Gly 

200 205 210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn Ser Ser Val lie Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 300 

Ala Gly Ser Val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 
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Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser He Thr Ser Leu Gly Val Lys Arg Ala 

395 400 



INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 

1 5 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 
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Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 

35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie Val 

50 55 60 

He Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly He Ala Cys Leu 

65 70 75 

Leu Ala He Ala Leu Thr Val Pro Leu Thr Arg Gin He Phe Phe 

80 85 90 

Pro Ala He Val He He Thr Trp Ala He Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe He Pro Glu Arg Trp Arg Pro Pro He Trp Val Arg Val 

110 115 120 

Leu Pro Thr Leu Glu Asn He Leu Tyr Gly Ser Asn Leu Ser Ser 

125 130 135 

Leu Leu Ser Lys Thr Thr His Ser He Leu Asp He Leu Ala Trp 

140 145 150 

Val Pro Tyr Gly Val Met His Tyr Ser Ala Pro Phe He He Ser 

155 160 165 

Phe He Leu Phe He Phe Ala Pro Pro Gly Thr Leu Pro Val Trp 

170 175 180 

Ala Arg Thr Phe Gly Tyr Met Asn Leu Phe Gly Val Leu He Gin 

185 190 195 

Met Ala Phe Pro Cys Ser Pro Pro Trp Tyr Glu Asn Met Tyr Gly 
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200 

Leu Glu Pro Ala Thr 

215 

Ala Arg lie Asp Ala 

230 

Phe Ser Asn Ser Pro 

245 

Ala Gly Trp Ala Met 

260 

Pro Arg Tyr Arg Phe 

275 

Trp Cys Thr Met Tyr 

290 

Gly Gly Met Cys Leu 

305 

Leu Arg Leu Pro Gin 

320 

Tyr Glu Phe Val He 

335 

Asn Ser Leu Ala Arg 

350 

Ser Phe Thr Gin Asn 

365 



205 

Tyr Ala Val Arg Gly 

220 

Leu Phe Gly Thr Ser 

235 

Val Val Phe Gly Ala 

250 

Leu Glu Ala Leu Phe 

265 

Cys Phe Tyr Gly Tyr 

280 

Leu Thr His His Tyr 

295 

Ala He He Cys Phe 

310 

Leu Gin Thr Gly Lys 

325 

His Gly His Gly Leu 

340 

Thr Gly Ser Pro Tyr 

355 

Pro Asn Ala Val Ala 

370 



210 

Ser Pro Gly Gly Leu 

225 

He Tyr Thr Asp Gly 

240 

Phe Pro Ser Leu His 

255 

Leu Ser His Val Phe 

270 

Val Leu Trp Leu Cys 

285 

Phe Val Asp Leu Val 

300 

Val Phe Ala Gin Lys 

315 

He Leu Arg Trp Glu 

330 

Ser Glu Lys Thr Ser 

345 

Leu Leu Gly Arg Asp 

360 

Phe Met Ser Gly Leu 

375 
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Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser Val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu lie 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser lie Phe Asp Ala Ser His 

410 415 420 

Leu Pro 



INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro Tyr 

15 10 15 

Gin Leu Phe His Tyr Tyr Phe Leu Ser Glu Lys Ala Pro Gly Ser 

20 25 30 

Thr Val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser Leu 

35 40 45 
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Arg Lys Leu Lys His His His Trp Thr Val Gly Glu lie Phe His 

50 55 60 

Tyr Gly Phe Leu Val Ser He Leu Phe Phe Val Phe Val Val Phe 

65 '70 75 

Pro Ala Ser Phe Phe He Lys Leu Pro He He Leu Ala Phe Ala 

80 85 90 

Thr Cys Phe Leu He Pro Leu Thr Ser Gin Phe Phe Leu Pro Ala 

95 100 105 

Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala Lys 

110 115 120 

He Pro Gin Glu Trp Lys Pro Ala lie Thr Val Lys Val Leu Pro 

125 130 135 

Ala Met Glu Thr He Leu Tyr Gly Asp Asn Leu' Ser Asn Val Leu 

140 145 150 

Ala Thr He Thr Thr Gly Val Leu Asp He Leu Ala Trp Leu Pro 

155 160 165 

Tyr Gly He He His Phe Ser Phe Pro Phe Val Leu Ala Ala He 

170 175 180 

He Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly Phe 

185 190 195 

Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu He Gin Met Ala 

200 205 210 

Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu Glu 
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215 220 225 

Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly Arg 

230 235 240 

lie Asp Lys Leu Leu Gly Val Asp Met Tyr Thr Thr Gly Phe Ser 

245 250 255 

Asn Ser Ser lie lie Phe Gly Ala Phe Pro Ser Leu His Ser Gly 

260 265 270 

Cys Cys lie Met Glu Val Leu Phe Leu Cys Trp Leu Phe Pro Arg 

275 280 285 

Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Trp Leu Trp Trp Ser 

290 295 300 

Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu lie Gly Gly 

305 310 315 

Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys Tyr 

320 325 330 

Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr Thr 

335 340 345 

Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu Ser Tyr 

350 355 360 

Asn Tyr lie Pro Val Asn Ser Asn Asp Asn Glu Ser Arg Leu Tyr 

365 370 375 

Thr Arg Val Tyr Gin Glu Ser Gin Val Ser Pro Pro Gin Arg Ala 

380 385 390 
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Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser Arg 

395 400 405 

Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn Asn 

410 415 420 

Asp Gin Val Ser Gly lie Asn Glu Glu Asp Glu Glu Glu Glu Gly 

425 430 435 

Asp Glu lie Ser Ser Ser Thr Pro Ser Val Phe Glu Asp Glu Pro 

440 445 450 

Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp Asp 

455 460 465 

Leu Asp Ser Lys Arg Asn 

470 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 
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(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "Xaa at position 3 is Val or He" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "Xaa at position 7 is Leu or Val" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Asp Xaa Leu Ala Trp Xaa Pro Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Phe Gly Ala Phe Pro Ser Leu His 
1 5 



-103- 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "Xaa at position 5 is Ser or Thr" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note= "Xaa at position 9 is Ala or Phe" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Met Tyr Leu Xaa His His Tyr Xaa Val Asp Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATTTTCTTCC CCATAACAAC TCTTCTCGCC CTTCCTCCGG CTCCGTGGCC AAATTGTTTT 60 

ATGCAGCGCC TCCTAGCGAT TTAACCTCGT TCTCGTTGCC CTTGCCTGTC CGCCTTGCGT 120 

CAGTACGACC CTTGCAACGT GACCTTCCCC AGAGTATCCT CGTTTGGCCG CTGGAGACCG 180 

GAGCTTGCAC CCTCATAAAC TAGCTCTTCG AAATCAATTC TCCGTTCTCC AGAGATTATC 240 

GGATCGAATC TCTCCGCTGT CGACACCTTT CGTCTCTCGG TGATCCTCGC CCTTGGAGTC 300 

TCGTCACGTT GACGCCTTGA ACCCCTGGCC GCCAACTCCA CATAGGAGAC CACACTTCAT 360 

TCTTCCCCCG CCATAATTGC AGCACCCTCC GTCTCCCTTC GAGCTCCTCC TGGATCATCA 420 

AGTCCGAAAG GATTAGACTC GTCGCAGCGA TGAATACCAC CCTTCCATCC TGGAAGGATC 480 

GGACGCAAAA CCAGTTCGGC AAGCTCCAGA TCCAAGTCCC ATGGCGCACC ATACAACTTC 54 0 

TCGTGCCGCA CCGTATGCGA CGGAAGATTC GGTCCAAGCT GCGCAGTCGG ATCTCGCCTA 600 

CCTCATCGAT ATCCTCGTTG CAGACGTCAT TCTCACCTGT CGATACACTC AGGTCGCTGC 660 

AAAGTCATAG ATGGACGCTC TATGACTTTC AGTATCTTTT GCTGCTGATT GTCGGCATAT 720 

TCTCGCTGAG CGTTATGGAA TCACCTGGAC CATTGGCAAA GACCGCCGCG TTTACGCTAC 780 

TTCTCGTCTC TCTCCTTCTC CCGATTACGC GCCAGTTCTT CTTGCCATTC CTCCCGATTG 84 0 

CAGGATGGCT TATATTTTTC TACGCTTGCC AGTTCATCCC GAGCGACTGG CGCCCTGCAA 900 
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TCTGGGTTCG CGTGCTGCCG GCTCTGGAAA ACATTCTCTA CGGTGCTAAT ATCAGTAACA 960 
TCCTTTCCGC TCACCAAAAT GTGGTGCTTG ACGTTTTGGC GTGGCTTCCC TACGGAATCT 1020 
GCCATTATGG CGCGCCATTT GTGTGCTCAG CGATCATGTT CATCTTTGGT CCTCCCGGCA 1080 
CCGTCCCCCT TTTCGCTCGA ACTTTTGGAT ACATCAGCAT GGCTGCAGTC ACCATTCAGC 1140 
TGTTTTTCCC CTGCTCTCCT CCGTGGTACG AAAATCTGTA TGGTTTGGCT CCGGCTGATT 1200 
ACTCCATGCC GGGTAATCCT GCGGGCCTTG CTCGCATCGA TGAGCTTTTT GGGATAGACT 1260 
TGTACACATC GGGCTTCAGA CAATCTCCCG TCGTGTTTGG CGCATTTCCT TCCCTACATG 1320 
CCGCTGATTC GACACTTGCA GCTCTATTTA TGAGCCAAGT GTTCCCACGG TTGAAGCCCT 1380 
TGTTTGTCAT CTATACTCTC TGGATGTGGT GGGCTACAAT GTATCTTTCG CACCACTACG 14 4 0 
CTGTTGATCT GGTCGGTGGT GGCCTCTTGG CAACTGTCGC GTTCTACTTT GCTAAAACGC 1500 
GGTTCATGCC TCGCGTCCAG AATGATAAGA TGTTCCGCTG GGACTACGAT TATGTTGAGT 1560 
ACGGCGATTC CGCACTCGAC TATGGGTACG GTCCAGCCAG CTTCGAAGGC GAATTCAACC 1620 
TTGATAGCGA TGAGTGGACC GTTGGTTCTT CGTCATCCAT TTCGTCCGGC TCCCTCAGTC 1680 
CAGTTGACGA CCACTACTCT TGGGAGGGCG AGACTCTTGC CTCTCCTGCC ACCGACATCG 17 4 0 
AGTCTGGAAG GCATTTCTGA TCCTGCTCAA TGAGCCTTGA TACGTACTAC ACTGTGTACG 1800 
TGCTACTGCA TTGACTAATG AGACGGCGTT TTCAAACAAA TTTTAACGAC ATGCTTGGTT 1860 
ATCGCATTGA GCTGATTTCG ACACATATAT ATGTTTAATA CGTTTTGGGG ACACTCCAGG 1920 
GAT T CAT G AC GGTTGCTTCA TATCCCGACC TGGGGATGGA TTGACCTGGT TGTGCCCAAT 1980 
TTTCTTCTGC CTAACGTTTT GATTATACAT GCATTTTTCA CGAAACCAGC CGGCCCGCCA 204 0 
TGATCGTGAC CTCAATTTGA GCTCGAATCT TCCTGGGGCC TCCAGCGATA ATTCTTAATG 2100 
CTCGTTCCGA GGGTGCCACA TCGGACATTC GCTTGTACAA CTTTTGCAGA ACGAACATTT 2160 
TCACCGATTC CAACTTGAGT CATTCGCTTA CTACTTTCAA CTGGTCGAGA AACTTCGCTT 2220 
CTTTTCAGCT CGGCTAGGTG CATAAATATT TTACATTCGT GTCGATCGCT CACATTTCAC 2280 
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GGCGCCTGGA AACTTGGGGG TTTCGATTTC ATTGGAAAGG ATAAACAACA TGGGCTGGGC 2340 
GCCTTTTACA CGCACTACAT TCGCTTAGAA AAGTTCTGAT GCTTTTAATG ATTCTTGCAT 24 00 
TGGCATATAG AAAGGGGTCC TCCAGACTCG CTACTGTGGT CCTCTCTCAA CCCCACACTC 24 60 
GCTTGCTTTT AACAGTGGAC ACCCCGTGGA GCTACGTCTC CATCAAATAT TTGGCATCAA 2520 
CCGGAATCGA TGCCAGGAGG ACTGAGCTTA CTCACGGTGA CCGTCGGGTA AAAGGCGTTC 2580 
ATACAGAACA TCTCCTCTAT CCTCCTGTCC CATCTCGATT CTCTGGCTGC TGGACGCAAC 264 0 
ACACCTCGCT GCACGTTTTC GACTTCCTAA TACGACCTAA TATCATTCTC GGTTTTCTTT 27 00 
GCTCTGGCTC GCGGCCGCCA TTTATATGGC ■ GTGTCGGCTC GAGTCTGAAG TGAACTTTTT 27 60 
TCTCCTTTCT GGCCTCCACA ACTTTCCGAT CCCTAGCAGC TTCCCGTGCA CAGCGAGGTG 2820 
TTGTTGGATG ATTGTTCCAT AGCATTATCA TTATTCCTAA TCCGGTAGCG TTATGATTTA 2880 
TGAAGAACAG TGATGTACAT TATTATGCGG TGATCAAAAA AAAAAAAAAA AAAAA 2935 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TAGTATACAA GTTGCCTTCA TAAATCATAA CACTACCGGA TTATAACTAA TAATACTGCT 60 
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GCTATGGATA ATTTGCCCCA ACCACACACA CACACACACT ATCCAAGGGA AACGAAAAGA 120 

AGAGGATAGT GTCCATGAGC GGAATTTGTG AGGTGAGGAG GATGTGATAG GTAGGTGTGA 180 

GAGCGTCAGA CCATAGCAAG CAAGTCAAGT CACCGTCCAA CAAGTCGAGA ATCTGAGGTT 24 0 

AGTCGAAGCC GGGTCAAATC GGGACGCGTG AACATCCTTC AACAAGCCCC GTCAGCGTCA 300 

GACTCTCCTC CCAACCTTGC ACGTCCGGCG AATGTCAGAA AGCAACACAG TAGCCATGTG 360 

TATAGCCCCC GGGTAATATG TGGACGTAAT CCACTTTGAG CGAATGGATA AACCATGCGA 420 

GCAATCAAGA AGAAGGTCCA TCCTTCCCCT CTTTCTATAT GCTAATGCAG AGTTCGTGAT 480 

GCATTGCAAA GCAGAAATGT AAAACCGTCT ATGCACAGCG AGTGAAGTGC AAGAAGAAAA 54 0 

GGCGCCCCAT TCCGTGATTT TTTCCAATAT AAATGCTTCC AGTCCCAAGC GCCAAGAGTA 600 

TGAGCGATCG ACACGAATGG TAAAAATAAA TATATAGCAT AGTCGAAACG AATAATAAGG 660 

ATGGGTTCGA CCAGGTCAAG GTAATAGCGG ACGATAGAAG TGAGAATTGA AGTAGTGTCT 720 

GGGTGGACAC GAGACAAGCG AGTGTTACCG ACGGGGGTTA TCGGATGGTC TTCCAGGACT 780 

AAACACCAGG ATGCGTAGCC CCATTGTGTC CATTAACATT ATGATGGAGC AACGGCGTGT 84 0 

AGCGATGAGC GATAACCCAT CCCACGTTCG AGCCCAACCC GGTTTCGCAA TGTCAAGCAA 900 

TGTACAGTGA AGTGCGGCGG AAAGAGAAAC AGTATCCCAG GCGGATTCGG ACCGAGGTAC 960 

ATTAAGGTCG AACCGATGAT CTGGGACACC GAAGAAAGGG TATTAAAGAT ATATGTGTCG 1020 

AAATGTGCTC GATGCAATAA AGCCAGAATG TCGTGAAATG TTCTAACGCC GTCTCGTAAA 1080 

TAGATGCAGT GGAGGAGGAG CGAGATCGCT ATATGCAGGT ATCGAGTTTG GTTTGTGGCT 114 0 

ACTCAAGGGC TGAAAGTATG CCTGCCGGAC TCAATATCAG TATGTGGGGA GGTCAGTGCC 1200 

TCGGTTTCCC ATGAGTAATG ATCGTCAACG GGACTCAAGG AGCCTGAGGA GACGGAGGAT 1260 

GAAGAACCAA CAGTCCATTC ATCGCTGTCG AGATTGAAGT CTCCATCATA GCCAGCTGCA 1320 

CCATACCCAT ACTCCAGGGC AGACTCGCCG AATTCCACAT AGTCGTAGTC CCAACGGAAG 1380 

GTCTTGTCGA GCTGGACACG GGGAAGGAAT CGGGTCTTGG CGAAGTAGAA AGCAATGGCG 14 4 0 
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GCCAGGAGAC CACCCGCAAC CAAATCGACC GCATAGTGAT GTGAGAGGTA CATTGTTGCC 1500 
CACCACATCC ATAGAGTATA GGTCACGAAG ACGGGCTTCA TGCGGGGGAA AACATGACTC 1560 
ATGAAAAGTG CGGCCAGGGT TGAGTCGGCA GCATGCAGCG ACGGAAAAGC GCCGAACACA 1620 
ACAGGCGACT GATGGAAAAC AGACGTGTAA AGGTCGATGC CGAAAAGCTT GTCAATGCGG 1680 
GCAAGCCCTG CGGGATCACC TTGGATGGAG TAGTCTGCCG GAGCTAGACC ATAGCGATTC 17 4 0 
TCATACCAAG GTGGAGAGCA AGGGAAAAAC AGCTGAATAG TAACCGCAGT CATACTGATA 1800 
TAGCCGAAAG TGCGCGCGAA AAGGGGAACA GTGCCGGGCG GACCGAAGAT GAACATGATC 18 60 
AACGAGCACA CAAACGGAGC GCCATAGTGG CAGATACCGT AGGGTAGCCA CGCCAGCACG 1920 
TCAAGCACAA CGTTCTGGTG AGCGGATAGG ATGTTGCTGA TGTTTGCGCC GTAGAGAATA 198 0 
TTCTCCAGTG CAGGCAAGAC ACGAACCCAA ATCGCAGGGC GCCAATCGCT TGGGATGAAC 204 0 
TGGCAGGCGT AGAAAAACAG AAGCCATCCG GCAATCGGCA GAAACGGGAG GAAGAACTGG 2100 
CGGGTCATAG GGATCAGGAG AGAGAATAGG AGCATGGAGA AAATGGCCGT TTTGCCCAAA 2160 
GGCCCGGGCG ACTCGATAAC GGTCAAAGAG AAGATGCCCA CGATCAACAG AAGCAGATAT 2220 
TGGAAGTCGT AAACCGTCCA TCGGTGGCTT TGGAGCGATC GTAGTGTGTC TGCAGGCGAT 2280 
AACGACGTCT GTAAAGAGGC TATTGACGAG GTAGGAGACG CTCTACTGCG CAATTTGGAC 234 0 
CTTAACTTCC GCCGCATGCG ATGCGGGACG AGCAGTTGGA TGGACCGCCA TGGAACCTGG 24 00 
ATCTGAAGCT TTCCAAACTG GTTCTGCGTG CGGTCCTTCC ACGTGGGAAG TGTTTGATTC 24 60 
ATGGTTGCGG CGAGTCTTGC GGGCAAGACT CAGCTATAAT TCCTCCAGAT CGAACTCCGT 2520 
CTGCTTTCTT TCCCCCCGAT TGTACAAGAG TTGGAAGGCG ACGAATAAAT CGGTCGCGGA 2580 
AGCGGCAGAA AAGGGGTCTT GAATAACAAG CCGTTAGGCA ACGAGCAGGG TGCTGACGGC 264 0 
GTCAACCCGA GCGGGGAAAT TGAGATCGGT TCGAAGAAGA CGAAACGAAG CGCACGCTCC 27 00 
GGTCTCCAGC GGCCAAACGA GGATGTCTTT GAGCCGTCAC GTTGCAGGGG GTTTCACTGA 27 60 
TGCAAGGCGA AGTGCAAGGG CGCAGCAGTC GAGAGGAAAA CCTGGAACGA GGAGGTCGTG 2820 
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AGGGAGGCAG ATGGCCACGG AGCCGGAGTA TAAACC 2856 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2856 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

UAGUAUACAA GUUGCCUUCA UAAAUCAUAA CACUACCGGA UUAUAACUAA UAAUACUGCU 60 

GCUAUGGAUA AUUUGCCCCA ACCACACACA CACACACACU AUCCAAGGGA AACGAAAAGA 120 

AGAGGAUAGU GUCCAUGAGC GGAAUUUGUG AGGUGAGGAG GAUGUGAUAG GUAGGUGUGA 180 

GAGCGUCAGA CCAUAGCAAG CAAGUCAAGU CACCGUCCAA CAAGUCGAGA AUCUGAGGUU 24 0 

AGUCGAAGCC GGGUCAAAUC GGGACGCGUG AACAUCCUUC AACAAGCCCC GUCAGCGUCA 300 

GACUCUCCUC CCAACCUUGC ACGUCCGGCG AAUGUCAGAA AGCAACACAG UAGCCAUGUG 360 

UAUAGCCCCC GGGUAAUAUG UGGACGUAAU CCACUUUGAG CGAAUGGAUA AACCAUGCGA 42 0 

GCAAUCAAGA AGAAGGUCCA UCCUUCCCCU CUUUCUAUAU GCUAAUGCAG AGUUCGUGAU 480 

GCAUUGCAAA GCAGAAAUGU AAAACCGUCU AUGCACAGCG AGUGAAGUGC AAGAAGAAAA 54 0 

GGCGCCCCAU UCCGUGAUUU UUUCCAAUAU AAAUGCUUCC AGUCCCAAGC GCCAAGAGUA 600 

UGAGCGAUCG ACACGAAUGG UAAAAAUAAA UAUAUAGCAU AGUCGAAACG AAUAAUAAGG 660 
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AUGGGUUCGA CCAGGUCAAG GUAAUAGCGG ACGAUAGAAG UGAGAAUUGA AGUAGUGUCU 720 
GGGUGGACAC GAGACAAGCG AGUGUUACCG ACGGGGGUUA UCGGAUGGUC UUCCAGGACU 780 
AAACACCAGG AUGCGUAGCC CCAUUGUGUC CAUUAACAUU AUGAUGGAGC AACGGCGUGU 84 0 
AGCGAUGAGC GAUAACCCAU CCCACGUUCG AGCCCAACCC GGUUUCGCAA UGUCAAGCAA 900 
UGUACAGUGA AGUGCGGCGG AAAGAGAAAC AGUAUCCCAG GCGGAUUCGG ACCGAGGUAC 960 
AUUAAGGUCG AACCGAUGAU CUGGGACACC GAAGAAAGGG UAUUAAAGAU AUAUGUGUCG 1020 
AAAUGUGCUC GAUGCAAUAA AGCCAGAAUG UCGUGAAAUG UUCUAACGCC GUCUCGUAAA 108 0 
UAGAUGCAGU GGAGGAGGAG CGAGAUCGCU AUAUGCAGGU AUCGAGUUUG GUUUGUGGCU 114 0 
ACUCAAGGGC UGAAAGUAUG CCUGCCGGAC UCAAUAUCAG UAUGUGGGGA GGUCAGUGCC 1200 
UCGGUUUCCC AUGAGUAAUG AUCGUCAACG GGACUCAAGG AGCCUGAGGA GACGGAGGAO 1260 
GAAGAACCAA CAGUCCAUUC AUCGCUGUCG AGAUUGAAGU CUCCAUCAUA GCCAGCUGCA 1320 
CCAUACCCAU ACUCCAGGGC AGACUCGCCG AAUUCCACAU AGUCGUAGUC CCAACGGAAG 1380 
GUCUUGUCGA GCUGGACACG GGGAAGGAAU CGGGUCUUGG CGAAGUAGAA AGCAAUGGCG 14 4 0 
GCCAGGAGAC CACCCGCAAC CAAAUCGACC GCAUAGUGAU GUGAGAGGUA CAUUGUUGCC 1500 
CACCACAUCC AUAGAGUAUA GGUCACGAAG ACGGGCUUCA UGCGGGGGAA AACAUGACUC 1560 
AUGAAAAGUG CGGCCAGGGU UGAGUCGGCA GCAUGCAGCG ACGGAAAAGC GCCGAACACA 1620 
ACAGGCGACU GAUGGAAAAC AGACGUGUAA AGGUCGAUGC CGAAAAGCUU GUCAAUGCGG 1680 
GCAAGCCCUG CGGGAUCACC UUGGAUGGAG UAGUCUGCCG GAGCUAGACC AUAGCGAUUC 17 4 0 
UCAUACCAAG GUGGAGAGCA AGGGAAAAAC AGCUGAAUAG UAACCGCAGU CAUACUGAUA 1800 
UAGCCGAAAG UGCGCGCGAA AAGGGGAACA GUGCCGGGCG GACCGAAGAU GAACAUGAUC 1860 
AACGAGCACA CAAACGGAGC GCCAUAGUGG CAGAUACCGU AGGGUAGCCA CGCCAGCACG 1920 
UCAAGCACAA CGUUCUGGUG AGCGGAUAGG AUGUUGCUGA UGUUUGCGCC GUAGAGAAUA 1980 
UUCUCCAGUG CAGGCAAGAC ACGAACCCAA AUCGCAGGGC GCCAAUCGCU UGGGAUGAAC 204 0 
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UGGCAGGCGU AGAAAAACAG AAGCCAUCCG GCAAUCGGCA GAAACGGGAG GAAGAACUGG 2100 
CGGGUCAUAG GGAUCAGGAG AGAGAAUAGG AGCAUGGAGA AAAUGGCCGU UUUGCCCAAA 2160 
GGCCCGGGCG ACUCGAUAAC GGUCAAAGAG AAGAUGCCCA CGAUCAACAG AAGCAGAUAU 2220 
UGGAAGUCGU AAACCGUCCA UCGGUGGCUU UGGAGCGAUC GUAGUGUGUC UGCAGGCGAU 2280 
AACGACGUCU GUAAAGAGGC UAUUGACGAG GUAGGAGACG CUCUACUGCG CAAUUUGGAC 234 0 
CUUAACUUCC GCCGCAUGCG AUGCGGGACG AGCAGUUGGA UGGACCGCCA UGGAACCUGG 24 00 
AUCUGAAGCU UUCCAAACUG GUUCUGCGUG CGGUCCUUCC ACGUGGGAAG UGUUUGAUUC 24 60 
AUGGUUGCGG CGAGUCUUGC GGGCAAGACU CAGCUAUAAU UCCUCCAGAU CGAACUCCGU 2520 
CUGCUUUCUU UCCCCCCGAU UGUACAAGAG UUGGAAGGCG ACGAAUAAAU CGGUCGCGGA 2580 
AGCGGCAGAA AAGGGGUCUU GAAUAACAAG CCGUUAGGCA ACGAGCAGGG UGCUGACGGC 264 0 
GUCAACCCGA GCGGGGAAAU UGAGAUCGGU UCGAAGAAGA CGAAACGAAG CGCACGCUCC 2700 
GGUCUCCAGC GGCCAAACGA GGAUGUCUUU GAGCCGUCAC GUUGCAGGGG GUUUCACUGA 27 60 
UGCAAGGCGA AGUGCAAGGG CGCAGCAGUC GAGAGGAAAA CCUGGAACGA GGAGGUCGUG 2820 
AGGGAGGCAG AUGGCCACGG AGCCGGAGUA UAAACC 2856 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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AAGCTTTTTT GCCTCTGCAA AAGTTCCTTT CTCGAATTGG TTTTTTGAGG AAAAGCAAGT 60 

TAATAAACTA ATTATATTAT ATATAATTAG CAATTTTATA AAAAAAATAA AAAAATAGCC 120 

CTGATTGCTG GCAACTGTGA GCTGAACATT GGTTAATCGG TCCATCTTTT TTTAAATATT 180 

TTACATCGCT ACTTTTAAGT GCTTGACACT TGCATTTAAT AGCTACTTTC TTTCCTTCAT 24 0 

AAAAATTCCT TTTTTTTCCT TTAGTTTTCC GGTTAATTCC TTACGAAATT TTTTTCGTAC 300 

GCTTCCCTTT TTTACTCTGA TAATTCTTTG AAGCAATGTC TGCTCTTTCG ACCTTAAAAA 360 

AGCGCCTTGC TGCGTGTAAC CGAGCATCCC AATACAAGTT GGAAACAAGC TTAAACCCTA 4 20 

TGCCTACATT TCGTTTGCTA CGCAATACGA AATGGTCATG GACACATTTG CAATATGTGT 4 80 

TTCTAGCAGG TAATTTGATT TTTGCTTGTA TTGTCATTGA ATCTCCTGGA TTCTGGGGGA 54 0 

AATTTGGCAT TGCCTGTCTT TTGGCCATTG CGTTGACCGT TCCTTTAACA CGCCAAATTT 600 

TTTTTCCTGC CATTGTTATC ATCACCTGGG CAATTTTATT TTACTCTTGT AGGTTTATTC 660 

CAGAACGCTG GCGTCCACCC ATATGGGTTC GTGTTTTACC CACACTTGAA AATATTCTTT 720 

ATGGCTCTAA TCTTTCTAGT CTTCTCTCGA AAACCACGCA TAGCATCCTT GATATTTTGG 78 0 

CCTGGGTTCC ATATGGAGTC ATGCATTATT CGGCTCCTTT TATCATTTCA TTTATTCTTT 84 0 

TCATCTTTGC ACCTCCTGGA ACTCTTCCAG TTTGGGCTCG AACTTTTGGT TATATGAATT 900 

TATTTGGTGT TCTTATCCAA ATGGCTTTCC CCTGTTCTCC TCCTTGGTAT GAAAATATGT 960 

ATGGTTTAGA ACCTGCCACG TATGCAGTAC GTGGCTCTCC TGGTGGATTG GCCCGTATTG 1020 

ATGCTCTCTT CGGCACTAGC ATTTACACTG ATTGTTTTTC TAACTCTCCG GTTGTTTTTG 1080 

GTGCCTTTCC ATCTCTTCAC GCTGGATGGG CCATGCTGGA AGCACTTTTC CTTTCGCATG 114 0 

TGTTTCCTCG ATACCGCTTC TGCTTTTATG GATATGTTCT ATGGCTTTGC TGGTGTACTA 1200 

TGTACCTTAC CCACCACTAC TTTGTAGATT TGGTCGGCGG TATGTGTTTA GCTATTATAT 12 60 

GCTTCGTTTT TGCTCAAAAG CTACGCCTCC CACAGTTGCA AACTGGTAAA ATCCTTCGTT 1320 
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GGGAATACGA GTTTGTTATC CACGGTCATG GTCTTTCCGA AAAAACCAGC AACTCCTTGG 1380 

CTCGTACCGG CAGCCCATAC TTACTTGGAA GGGATTCTTT TACTCAAAAC CCTAATGCAG 14 40 

TAGCCTTCAT GAGTGGTCTT AACAATATGG AACTTGCTAA CACCGATCAT GAATGGTCCG 1500 

TGGGTTCATC ATCACCTGAG CCGTTACCTA GTCCTGCTGC TGATTTGATT GATCGTCCTG 1560 

CCAGTACCAC TTCCTCCATC TTTGATGCAA GTCATCTTCC TTAAATCAAC GTGCTTTAAG 1620 

AATATATTTC CAAAAGCTAC ATGATACATT GACTAGAATC GGTTTGATTC ATAGTGGTAT 1680 

TGGAATGATG TTGTTCATTG TGTTTTTTAA CTGTTAATCT GACATCCATT GAGTCATTCT 17 40 

TTACAATTTG TAAAATTAAT TTGTATCACT AATTTTGAAG GAAGCTATTT TGGTATTAAT 1800 

ACCGCTTTTG GTCTCCACTT CCTTTTCGAA ACTCTTAACA GCGATTAGGC CGGGTATCTT 1860 

CCAGTGTGAT GTATAGGTAT TTGTCGTTTT TTTATCATTT CCGTTAATAA AGAACTCTTT 1920 

TATCCAGCTT CTTACACTGT CAACTGTTGT GAAAGGAACA CATTTAGAAT TTCATTTTCC 1980 

TTATTTGTTG TGATTTAAAT CGTTTGACAT AATTTTAAAT TTGGTTTGAA ATGTGTGTGA 204 0 

GAAGGCTTGT TTTATTCATT TAGTTTATTG CTTGTTTGCA CGAAAATCCA GAACGGAGCA 2100 

TTAATGTAAT CCTTTTTTAT TCTGTAAAGC GTTTTTATAC AAATGTTGGT TATACGTTTC 2160 

TAAAATAAGA ATATTGTTAT AATAATATAG TTTTTTCTAT CATTTGTTAC ACACACTAAA 2220 

GAGACATTAA GGATAAGCAA ATGTGTTAAA ATGATAATAT ATTTTGGAAA CATTTATAAA 2280 

GAAATTAAGC AGCTTTGACT AACTACATTT TTGTTTTTTT CCTAAGCAAA ACTGTATAGT 2340 

TATACACGCG AGCTGTATTC ACTTCCATTG TAGTGACTTG AGCTC 2385 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 
15 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys' Trp Ser Trp Thr His Leu 

35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie Val 

50 55 60 

lie Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly lie Ala Cys Leu 

65 70 75 

Leu Ala lie Ala Leu Thr Val Pro Leu Thr Arg Gin lie Phe Phe 

80 85 90 

Pro Ala lie Val lie lie Thr Trp Ala lie Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe lie Pro Glu Arg Trp Arg Pro Pro lie Trp Val Arg Val 

110 115 120 

Leu Pro Thr Leu Glu Asn lie Leu Tyr Gly Ser Asn Leu Ser Ser 

125 130 135 
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Leu Leu Ser Lys Thr 

140 

Val Pro Tyr Gly Val 

155 

Phe He Leu Phe He 

170 

Ala Arg Thr Phe Gly 

185 

Met Ala Phe Pro Cys 

200 

Leu Glu Pro Ala Thr 

215 

Ala Arg He Asp Ala 

230 

Phe Ser Asn Ser Pro 

245 

Ala Gly Trp Ala Met 

260 

Pro Arg Tyr Arg Phe 

275 

Trp Cys Thr Met Tyr 

290 

Gly Gly Met Cys Leu 



Thr His Ser He Leu 

145 

Met His Tyr Ser Ala 

160 

Phe Ala Pro Pro Gly 

175 

Tyr Met Asn Leu Phe 

190 

Ser Pro Pro Trp Tyr 

205 

Tyr Ala Val Arg Gly 

220 

Leu Phe Gly Thr Ser 

235 

Val Val Phe Gly Ala 

250 

Leu Glu Ala Leu Phe 

265 

Cys Phe Tyr Gly Tyr 

280 

Leu Thr His His Tyr 

295 

Ala He He Cys Phe 



Asp He Leu Ala Trp 

150 

Pro Phe He He Ser 

165 

Thr Leu Pro Val Trp 

180 

Gly Val Leu He Gin 

195 

Glu Asn Met Tyr Gly 

210 

Ser Pro Gly Gly Leu 

225 

He Tyr Thr Asp Cys 

240 

Phe Pro Ser Leu His 

255 

Leu Ser His Val Phe 

270 

Val Leu Trp Leu Cys 

285 

Phe Val Asp Leu Val 

300 

Val Phe Ala Gin Lys 
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305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys lie Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val He His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala Val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser Val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu He 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser He Phe Asp Ala Ser His 

410 415 420 

Leu Pro 



INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



AAGCTTTTTT GCCTCTGCAA AAGTTCCTTT CTCGAATTGG TTTTTTGAGG AAAAGCAAGT 60 

TAATAAACTA ATTATATTAT ATATAATTAG CAATTTTATA AAAAAAATAA AAAAATAGCC 120 

CTGATTGCTG GCAACTGTGA GCTGAACATT GGTTAATCGG TCCATCTTTT TTTAAATATT 180 

TTACATCGCT ACTTTTAAGT GCTTGACACT TGCATTTAAT AGCTACTTTC TTTCCTTCAT 24 0 

AAAAATTCCT TTTTTTTCCT TTAGTTTTCC GGTTAATTCC TTACGAAATT TTTTTCGTAC 300 

GCTTCCCTTT TTTACTCTGA TAATTCTTTG AAGCAATGTC TGCTCTTTCG ACCTTAAAAA 360 

AGCGCCTTGC TGCGTGTAAC CGAGCATCCC AATACAAGTT GGAAACAAGC TTAAACCCTA 420 

TGCCTACATT TCGTTTGCTA CGCAATACGA AATGGTCATG GACACATTTG CAATATGTGT 4 80 

TTCTAGCAGG TAATTTGATT TTTGCTTGTA TTGTCATTGA ATCTCCTGGA TTCTGGGGGA 54 0 
AATTTGGCAT TGCCTGTCTT TTGGCCATTG CGTTGACCGT TCCTTTAACA CGCCAAATTT 600 
TTTTTCCTGC CATTGTTATC ATCACCTGGG CAATTTTATT TTACTCTTGT AGGTTTATTC 660 

CAGAACGCTG GCGTCCACCC ATATGGGTTC GTGTTTTACC CACACTTGAA AATATTCTTT 720 

ATGGCTCTAA TCTTTCTAGT CTTCTCTCGA AAACCACGCA TAGCATCCTT GATATTTTGG 7 80 

CCTGGGTTCC ATATGGAGTC ATGCATTATT CGGCTCCTTT TATCATTTCA TTTATTCTTT 840 
TCATCTTTGC ACCTCCTGGA ACTCTTCCAG TTTGGGCTCG AACTTTTGGT TATATGAATT 900 
TATTTGGTGT TCTTATCCAA ATGGCTTTCC CCTGTTCTCC TCCTTGGTAT GAAAATATGT 960 

ATGGTTTAGA ACCTGCCACG TATGCAGTAC GTGGCTCTCC TGGTGGATTG GCCCGTATTG 1020 

ATGCTCTCTT CGGCACTAGC ATTTACACTG ATGGTTTTTC TAACTCTCCG GTTGTTTTTG 1080 

GTGCCTTTCC ATCTCTTCAC GCTGGATGGG CCATGCTGGA AGCACTTTTC CTTTCGCATG 114 0 

TGTTTCCTCG ATACCGCTTC TGCTTTTATG GATATGTTCT ATGGCTTTGC TGGTGTACTA 1200 
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TGTACCTTAC CCACCACTAC TTTGTAGATT TGGTCGGCGG TATGTGTTTA GCTATTATAT 1260 
GCTTCGTTTT TGCTCAAAAG CTACGCCTCC CACAGTTGCA AACTGGTAAA ATCCTTCGTT 1320 
GGGAATACGA GTTTGTTATC CACGGTCATG GTCTTTCCGA AAAAACCAGC AACTCCTTGG 1380 
CTCGTACCGG CAGCCCATAC TTACTTGGAA GGGATTCTTT TACTCAAAAC CCTAATGCAG 144 0 
TAGCCTTCAT GAGTGGTCTT AACAATATGG AACTTGCTAA CACCGATCAT GAATGGTCCG 1500 
TGGGTTCATC ATCACCTGAG CCGTTACCTA GTCCTGCTGC TGATTTGATT GATCGTCCTG 1560 
CCAGTACCAC TTCCTCCATC TTTGATGCAA GTCATCTTCC TTAAATCAAC GTGCTTTAAG 1620 
AATATATTTC CAAAAGCTAC ATGATACATT GACTAGAATC GGTTTGATTC ATAGTGGTAT 1680 
TGGAATGATG TTGTTCATTG TGTTTTTTAA CTGTTAATCT GACATCCATT GAGTCATTCT 17 4 0 
TTACAATTTG TAAAATTAAT TTGTATCACT AATTTTGAAG GAAGCTATTT TGGTATTAAT 1800 
ACCGCTTTTG GTCTCCACTT CCTTTTCGAA ACTCTTAACA GCGATTAGGC CGGGTATCTT 1860 
CCAGTGTGAT GTATAGGTAT TTGTCGTTTT TTTATCATTT CCGTTAATAA AGAACTCTTT 1920 
TATCCAGCTT CTTACACTGT CAACTGTTGT GAAAGGAACA CATTTAGAAT TTCATTTTCC 1980 
TTATTTGTTG TGATTTAAAT CGTTTGACAT AATTTTAAAT TTGGTTTGAA ATGTGTGTGA 204 0 
GAAGGCTTGT TTTATTCATT TAGTTTATTG CTTGTTTGCA CGAAAATCCA GAACGGAGCA 210 0 
TTAATGTAAT CCTTTTTTAT TCTGTAAAGC GTTTTTATAC AAATGTTGGT TATACGTTTC 2160 
TAAAATAAGA ATATTGTTAT AATAATATAG TTTTTTCTAT CATTTGTTAC ACACACTAAA 222 0 
GAGACATTAA GGATAAGCAA ATGTGTTAAA ATGATAATAT ATTTTGGAAA CATTTATAAA 2280 
GAAATTAAGC AGCTTTGACT AACTACATTT TTGTTTTTTT CCTAAGCAAA ACTGTATAGT 234 0 
TATACACGCG AGCTGTATTC ACTTCCATTG TAGTGACTTG AGCTC 2385 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Ala Leu Ser Thr Leu Lys Lys Arg Leu Ala Ala Cys Asn 

15 10 15 

Arg Ala Ser Gin Tyr Lys Leu Glu Thr Ser Leu Asn Pro Met Pro 

20 25 30 

Thr Phe Arg Leu Leu Arg Asn Thr Lys Trp Ser Trp Thr His Leu 

35 40 45 

Gin Tyr Val Phe Leu Ala Gly Asn Leu lie Phe Ala Cys lie Val 

50 55 60 

lie Glu Ser Pro Gly Phe Trp Gly Lys Phe Gly lie Ala Cys Leu 

65 70 75 

Leu Ala lie Ala Leu Thr Val Pro Leu Thr Arg Gin lie Phe Phe 

80 85 90 

Pro Ala lie Val lie lie Thr Trp Ala He Leu Phe Tyr Ser Cys 

95 100 105 

Arg Phe He Pro Glu Arg Trp Arg Pro Pro He Trp Val Arg Val 

110 115 120 
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Leu Pro Thr Leu Glu 

125 

Leu Leu Ser Lys Thr 

140 

Val Pro Tyr Gly Val 

155 

Phe lie Leu Phe lie 

170 

Ala Arg Thr Phe Gly 

185 

Met Ala Phe Pro Cys 

200 

Leu Glu Pro Ala Thr 

215 

Ala Arg lie Asp Ala 

230 

Phe Ser Asn Ser Pro 

245 

Ala Gly Trp Ala Met 

260 

Pro Arg Tyr Arg Phe 

275 

Trp Cys Thr Met Tyr 



Asn lie Leu Tyr Gly 

130 

Thr His Ser lie Leu 

145 

Met His Tyr Ser Ala 

160 

Phe Ala Pro Pro Gly 

175 

Tyr Met Asn Leu Phe 

190 

Ser Pro Pro Trp Tyr 

205 

Tyr Ala Val Arg Gly 

220 

Leu Phe Gly Thr Ser 

235 

Val Val Phe Gly Ala 

250 

Leu Glu Ala Leu Phe 

265 

Cys Phe Tyr Gly Tyr 

280 

Leu Thr His His Tyr 



Ser Asn Leu Ser Ser 

135 

Asp lie Leu Ala Trp 

150 

Pro Phe lie lie Ser 

165 

Thr Leu Pro Val Trp 

180 

Gly Val Leu He Gin 

195 

Glu Asn Met Tyr Gly 

210 

Ser Pro Gly Gly Leu 

225 

He Tyr Thr Asp Gly 

240 

Phe Pro Ser Leu His 

255 

Leu Ser His Val Phe 

270 

Val Leu Trp Leu Cys 

285 

Phe Val Asp Leu Val 
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290 295 300 

Gly Gly Met Cys Leu Ala He He Cys Phe Val Phe Ala Gin Lys 

305 310 315 

Leu Arg Leu Pro Gin Leu Gin Thr Gly Lys He Leu Arg Trp Glu 

320 325 330 

Tyr Glu Phe Val He His Gly His Gly Leu Ser Glu Lys Thr Ser 

335 340 345 

Asn Ser Leu Ala Arg Thr Gly Ser Pro Tyr Leu Leu Gly Arg Asp 

350 355 360 

Ser Phe Thr Gin Asn Pro Asn Ala Val Ala Phe Met Ser Gly Leu 

365 370 375 

Asn Asn Met Glu Leu Ala Asn Thr Asp His Glu Trp Ser Val Gly 

380 385 390 

Ser Ser Ser Pro Glu Pro Leu Pro Ser Pro Ala Ala Asp Leu He 

395 400 405 

Asp Arg Pro Ala Ser Thr Thr Ser Ser lie Phe Asp Ala Ser His 

410 415 420 

Leu Pro 



INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2340 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TTTCTTTCTG TCAAAGAATA ATAAAGTGCC CATCAGTGTT CATATTTGTT ACAAAGTGGT 60 

TTTCTGATTT GGTACTACTG CAGAGGCGTA TTTTTTGCTT CAGTTACCAT AGCGTAAGAA 120 

CACTAGCGAC TTTTGTTCGT GAACCAACAG AGTAGGATTT CTACTGCTAC ATCTCTTAGG 18 0 

TAGTTGGTTA GTCCGATCGC TCACTTTTGG TTGTTGTTAA GTACTTCATA AGTTTATCCT 24 0 

TTTCCTTTTT CACACTGAGC TACTTTGGGT ATAGCTTTTG GCCCAAGGAT CTTTGAATTT 300 

TCTCCAAAAG TACTTTATTT TATATCCTAC AGGTTGCGGT TTTCATATTT TAAAAAGCTT 360 

TTTAATCATT CCTTTGCGTA TGGCAAACCC TTTTTCGAGA TGGTTTCTAT CAGAGAGACC 420 

TCCAAACTGC CATGTAGCCG ATTTAGAAAC AAGTTTAGAT CCCCATCAAA CGTTGTTGAA 480 

GGTGCAAAAA TACAAACCCG CTTTAAGCGA CTGGGTGCAT TACATCTTCT TGGGATCCAT 54 0 

CATGCTGTTT GTGTTCATTA CTAATCCCGC ACCTTGGATC TTCAAGATCC TTTTTTATTG 600 

TTTCTTGGGC ACTTTATTCA TCATTCCAGC TACGTCACAG TTTTTCTTCA ATGCCTTGCC 660 

CATCCTAACA TGGGTGGCGC TGTATTTCAC TTCATCGTAC TTTCCAGATG ACCGCAGGCC 720 

TCCTATTACT GTCAAAGTGT TACCAGCGGT GGAAACAATT TTATACGGCG ACAATTTAAG 78 0 

TGATATTCTT GCAACATCGA CGAATTCCTT TTTGGACATT TTAGCATGGT TACCGTACGG 84 0 

ACTATTTCAT TATGGGGCCC CATTTGTCGT TGCTGCCATC TTATTCGTAT TTGGTCCACC 900 

AACTGTTTTG CAAGGTTATG CTTTTGCATT TGGTTATATG AACCTGTTTG GTGTTATCAT 960 

GCAAAATGTC TTTCCAGCCG CTCCCCCATG GTATAAAATT CTCTATGGAT TGCAATCAGC 1020 

CAACTATGAT ATGCATGGCT CGCCTGGTGG ATTAGCTAGA ATTGATAAGC TACTCGGTAT 1080 
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TAATATGTAT ACTACAGCTT TTTCAAATTC CTCCGTCATT 
GCATTCCGGG TGTGCTACTA TGGAAGCCCT GTTTTTCTGT 
GCCCTTGTTT ATTGCTTATG TTTGCTGGTT ATGGTGGTCA 
TTATTTTGTA GACCTTATGG CAGGTTCTGT GCTGTCATAC 
GTACACACAT TTACCAATTG TAGATACATC TCTTTTTTGC 
TGAGAAATAC GATATATCAA AGAGTGATCC ATTGGCTGCA 
TGTCCCTTTG TCCAACTTGG AACTTGACTT TGATCTTAAT 
AAGCCCTTCG TTATTTGATG GATCTACTTC TGTTTCTCGT 
GTCACTAGGT GTAAAGAGGG CTTAATGAGT ATTTTATCTG 
CTTATGTAGA TACATATAAA TATATATCTT TTTCTTTCTT 
TTTAGCATAA TATACATGTA GTTTATTTAA TCACATACCA 
ATAAATTTTT GAAATAAATG GGTGGCTTTT AATGGTGTCT 
AATGCTCTTC CTGCTTTGTT TATTATATGT GTATGAAAGA 
TTTGTAGCGT CCCCAGTCAA AACCTGTGCG CTATACCTAA 
CACTAATTCT AAAATAGACT TCTTCCCCAA AGAACGGTGT 
CTGCTTATCT AAATCAACTT TAACGATGGA TGATCTTATG 
AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA 
TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA 
GGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC 
CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT 
CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT 

(2) INFORMATION FOR SEQ ID NO: 20: 



TTCGGTGCTT TTCCTTCACT 114 0 
TATTGTTTTC CAAAATTGAA 1200 
ACTATGTATC TGACACACCA 12 60 
GTTATTTTCC AGTACACAAA 1320 
AGATGGTCAT ACACTTCAAT 1380 
GATTCAAACG ATATCGAAAG 14 40 
ATGACTGATG AACCCAGTGT 1500 
TCGTCCGCCA CGTCTATAAC 1560 
CAATTACGGA TACGGTTGGT 1620 
TTTCTTAGTC AGGATTGTCG 1680 
CTGATTATCT TTAGAATTTT 17 4 0 
ATGTTAAGTG AGGCTTTTAG 1800 
TATGTATGTA TTTACATGTG 18 60 
ATGGATTGAT AATCTTCATT 1920 
AACGATGAGG CTCTATCCAG 1980 
ACACGGGGAT CTTTCTTTAA 2040 
AACAGCATTA AAAAGTGATA 2100 
GGAAGATACC TTATGCCCAT 2160 
AGTTTCCCGA TTGTGGAAGA 2220 
ATGCGAACAA GTCCGATCAG 2280 
TCTTTCCTCT GTTGAAGCTT 234 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 
15 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 40 45 

Val His Tyr lie Phe Leu Gly Ser lie Met Leu Phe Val Phe lie 

50 55 60 

Thr Asn Pro Ala Pro Trp lie Phe Lys lie Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe lie lie Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp Val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr Val Lys Val 
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110 115 120 

Leu Pro Ala Val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala He Leu Phe Val Phe Gly Pro Pro Thr Val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Val He Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys He Leu Tyr Gly 

200 205 210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg He Asp Lys Leu Leu Gly He Asn Met Tyr Thr Thr Ala 

230 235 240 

Phe Ser Asn Ser Ser Val He Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe He Ala Tyr Val Cys Trp Leu Trp 

275 280 285 
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Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 300 

Ala Gly Ser Val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 

395 400 



INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic 
(xi) SEQUENCE DESCRIPTION: ! 

TTTCTTTCTG TCAAAGAATA ATAAAGTGCC 
TTTCTGATTT GGTACTACTG CAGAGGCGTA 
CACTAGCGAC TTTTGTTCGT GAACCAACAG 
TAGTTGGTTA GTCCGATCGC TCACTTTTGG 
TTTCCTTTTT CACACTGAGC TACTTTGGGT 
TCTCCAAAAG TACTTTATTT TATATCCTAC 
TTTAATCATT CCTTTGCGTA TGGCAAACCC 
TCCAAACTGC CATGTAGCCG ATTTAGAAAC 
GGTGCAAAAA TACAAACCCG CTTTAAGCGA 
CATGCTGTTT GTGTTCATTA CTAATCCCGC 
TTTCTTGGGC ACTTTATTCA TCATTCCAGC 
CATCCTAACA TGGGTGGCGC TGTATTTCAC 
TCCTATTACT GTCAAAGTGT TACCAGCGGT 
TGATATTCTT GCAACATCGA CGAATTCCTT 
ACTATTTCAT TTTGGGGCCC CATTTGTCGT 
AACTGTTTTG CAAGGTTATG CTTTTGCATT 
GCAAAATGTC TTTCCAGCCG CTCCCCCATG 
CAACTATGAT ATGCATGGCT CGCCTGGTGG 
TAATATGTAT ACTACAGCTT TTTCAAATTC 
GCATTCCGGG TGTGCTACTA TGGAAGCCCT 



DNA 

EQ ID NO:21: 

CATCAGTGTT CATATTTGTT ACAAAGTGGT 60 

TTTTTTGCTT CAGTTACCAT AGCGTAAGAA 120 

AGTAGGATTT CTACTGCTAC ATCTCTTAGG 180 

TTGTTGTTAA GTACTTCATA AGTTTATCCT 24 0 

ATAGCTTTTG GCCCAAGGAT CTTTGAATTT 300 

AGGTTGCGGT TTTCATATTT TAAAAAGCTT 360 

TTTTTCGAGA TGGTTTCTAT CAGAGAGACC 420 

AAGTTTAGAT CCCCATCAAA CGTTGTTGAA 4 80 

CTGGGTGCAT TACATCTTCT TGGGATCCAT 54 0 

ACCTTGGATC TTCAAGATCC TTTTTTATTG 600 

TACGTCACAG TTTTTCTTCA ATGCCTTGCC 660 

TTCATCGTAC TTTCCAGATG ACCGCAGGCC 720 

GGAAACAATT TTATACGGCG ACAATTTAAG 780 

TTTGGACATT TTAGCATGGT TACCGTACGG 840 

TGCTGCCATC TTATTCGTAT TTGGTCCACC 900 

TGGTTATATG AACCTGTTTG GTGTTATCAT 960 

GTATAAAATT CTCTATGGAT TGCAATCAGC 1020 

ATTAGCTAGA ATTGATAAGC TACTCGGTAT 1080 

CTCCGTCATT TTCGGTGCTT TTCCTTCACT 114 0 

GTTTTTCTGT TATTGTTTTC CAAAATTGAA 1200 
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GCCCTTGTTT ATTGCTTATG TTTGCTGGTT ATGGTGGTCA 
TTATTTTGTA GACCTTATGG CAGGTTCTGT GCTGTCATAC 
GTACACACAT TTACCAATTG TAGATACATC TCTTTTTTGC 
TGAGAAATAC GATATATCAA AGAGTGATCC ATTGGCTGCA 
TGTCCCTTTG TCCAACTTGG AACTTGACTT TGATCTTAAT 
AAGCCCTTCG TTATTTGATG GATCTACTTC TGTTTCTCGT 
GTCACTAGGT GTAAAGAGGG CTTAATGAGT ATTTTATCTG 
CTTATGTAGA TACATATAAA TATATATCTT TTTCTTTCTT 
TTTAGCATAA TATACATGTA GTTTATTTAA TCACATACCA 
ATAAATTTTT GAAATAAATG GGTGGCTTTT AATGGTGTCT 
AATGCTCTTC CTGCTTTGTT TATTATATGT GTATGAAAGA 
TTTGTAGCGT CCCCAGTCAA AACCTGTGCG CTATACCTAA 
CACTAATTCT AAAATAGACT TCTTCCCCAA AGAACGGTGT 
CTGCTTATCT AAATCAACTT TAACGATGGA TGATCTTATG 
AGTTCTTAGA ATTTCAGACT GTACCGCAGC TGATGAATCA 
TGCTCGAAAA TGTTTTTCCT GGTCTTTCTT CATTATTTTA 
GGGTACAATG TCCCTCACCA CACCTCTGTT TTGAATAATC 
CAATTCTTTT GCTTCCAACT TTGGCGCATT GGAGTTGGTT 
CTCATAAAGC ATCTTAGTGA AAAGGGTGGT TTTGCGTTAT 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 



ACTATGTATC TGACACACCA 1260 
GTTATTTTCC AGTACACAAA 1320 
AGATGGTCAT ACACTTCAAT 1380 
GATTCAAACG ATATCGAAAG 1440 
ATGACTGATG AACCCAGTGT 1500 
TCGTCCGCCA CGTCTATAAC 1560 
CAATTACGGA TACGGTTGGT 1620 
TTTCTTAGTC AGGATTGTCG 1680 
CTGATTATCT TTAGAATTTT 17 4 0 
ATGTTAAGTG AGGCTTTTAG 1800 
TATGTATGTA TTTACATGTG 18 60 
ATGGATTGAT AATCTTCATT 1920 
AACGATGAGG CTCTATCCAG 1980 
ACACGGGGAT CTTTCTTTAA 204 0 
AACAGCATTA AAAAGTGATA 2100 
GGAAGATACC TTATGCCCAT 2160 
AGTTTCCCGA TTGTGGAAGA 2220 
ATGCGAACAA GTCCGATCAG 2280 
TCTTTCCTCT GTTGAAGCTT 234 0 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 

15 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 40 45 

Val His Tyr lie Phe Leu Gly Ser He Met Leu Phe Val Phe He 

50 55 60 

Thr Asn Pro Ala Pro Trp He Phe Lys He Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe He He Pro Ala Thr Ser Gin Phe Phe Phe 

80 85 90 

Asn Ala Leu Pro He Leu Thr Trp Val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro He Thr Val Lys Val 

110 115 120 

Leu Pro Ala Val Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asp 
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lie Leu Ala 



Leu Pro Tyr 



Ala lie Leu 



Ala Phe Ala 



Asn Val Phe 



Leu Gin Ser 



Ala Arg lie 



Phe Ser Asn 



Ser Gly Cys 



Pro Lys Leu 



Trp Ser Thr 



125 

Thr Ser Thr Asn 
140 

Gly Leu Phe His 
155 

Phe Val Phe Gly 
170 

Phe Gly Tyr Met 
185 

Pro Ala Ala Pro 
200 

Ala Asn Tyr Asp 
215 

Asp Lys Leu Leu 
230 

Ser Ser Val lie 
245 

Ala Thr Met Glu 
260 

Lys Pro Leu Phe 
275 

Met Tyr Leu Thr 
290 
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Ser Phe Leu Asp 
145 

Phe Gly Ala Pro 
160 

Pro Pro Thr Val 
175 

Asn Leu Phe Gly 
190 

Pro Trp Tyr Lys 
205 

Met His Gly Ser 
220 

Gly lie Asn Met 
235 

Phe Gly Ala Phe 
250 

Ala Leu Phe Phe 
265 

He Ala Tyr Val 
280 

His His Tyr Phe 
295 
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He Leu Ala Trp 
150 

Phe Val Val Ala 
165 

Leu Gin Gly Tyr 
180 

Val He Met Gin 
195 

He Leu Tyr Gly 
210 

Pro Gly Gly Leu 
225 

Tyr Thr Thr Ala 

240 

Pro Ser Leu His 
255 

Cys Tyr Cys Phe 
270 

Cys Trp Leu Trp 
285 

Val Asp Leu Met 
300 
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Ala Gly Ser Val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 

395 400 



INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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AGCGCTTCTA TTTTCCTCCC CACCGCGAGG CGGAAATGGC ACATTTTTTT TCTTTTGCTT 60 

CTGTGCTTTT GCTGTAATTT TTGGCATGTG CTATTGTATG AAGATAACGC GTGGTTCCGT 120 

GGAAATAGCC GGAAATTTTG CCGGGAATAT GACGGACATG ATTTAACACC CGTGGAAATG 180 

AAAAAAGCCA AGGTAAGAAA GTGGCAATAT TTTTCCTACA AATAGATCTG CTGTCCCTTA 240 

GATGATTACC ATACATATAT ATATTTATTA CACACATCTG TCAGAGGTAG CTAGCGAAGG 300 

TGTCACTGAA ATATTTTTTG TTCCAGTTAG TATAAATACG GAGGTAGAAC AGCTCTCCGC 360 

GTGTATATCT TTTTTTGCGC TATACAAGAA CAGGAAGAAC GCATTTCCAT ACCTTTTTCT 4 20 

CCTTACAGGT GCCCTCTGAG TAGTGTCACG AACGAGGAAA AAGATTAATA TTACTGTTTT 4 80 

TATATTCAAA AAGAGTAAAG CCGTTGCTAT ATACGAATAT GACGATTACC GTGGGGGATG 54 0 

CAGTTTCGGA GACGGAGCTG GAAAACAAAA GTCAAAACGT GGTACTATCT CCCAAGGCAT 600 

CTGCTTCTTC AGACATAAGC ACAGATGTTG ATAAAGACAC ATCGTCTTCT TGGGATGACA 660 

AATCTTTGCT GCCTACAGGT GAATATATTG TGGACAGAAA TAAGCCCCAA ACCTACTTGA 720 

ATAGCGATGA TATCGAAAAA GTGACAGAAT CTGATATTTT CCCTCAGAAA CGTCTGTTTT 7 80 

CATTCTTGCA CTCTAAGAAA ATTCCAGAAG TACCACAAAC CGATGACGAG AGGAAGATAT 840 

ATCCTCTGTT CCATACAAAT ATTATCTCTA ACATGTTTTT TTGGTGGGTT CTACCCATCC 900 

TGCGAGTTGG TTATAAGAGA ACGATACAGC CGAACGATCT CTTCAAAATG GATCCGAGGA 960 

TGTCTATAGA GACCCTTTAT GACGACTTTG AAAAAAACAT GATTTACTAT TTTGAGAAGA 1020 

CGAGGAAAAA ATACCGTAAA AGACATCCAG AAGCGACAGA AGAAGAGGTT ATGGAAAATG 1080 

CCAAACTACC TAAACATACA GTTCTGAGAG CTTTATTATT CACTTTTAAG AAACAGTACT 114 0 

TCATGTCGAT AGTGTTTGCA ATTCTCGCTA ATTGTACATC CGGTTTTAAC CCCATGATTA 1200 

CCAAGAGGCT AATTGAGTTT GTCGAAGAAA AGGCTATTTT TCATAGCATG CATGTTAACA 1260 

AAGGTATTGG TTACGCTATT GGTGCATGTT TGATGATGTT CGTTAACGGG TTGACGTTCA 1320 

ATCATTTCTT TCATACATCC CAACTGACTG GTGTGCAAGC TAAGTCTATT CTTACTAAAG 1380 
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CTGCCATGAA GAAAATGTTT AATGCATCTA ATTATGCGAG ACATTGTTTT CCTAACGGTA 14 40 
AAGTGACTTC TTTTGTAACA ACAGATCTCG CTAGAATTGA ATTTGCCTTA TCTTTTCAGC 1500 
CGTTTTTGGC TGGGTTCCCT GCAATTTTGG CTATTTGCAT TGTTTTATTG ATCGTTAACC 1560 
TTGGACCCAT TGCCTTAGTT GGGATTGGTA TTTTTTTCGG TGGGTTTTTC ATATCCTTAT 1620 
TTGCATTTAA GTTAATTCTG GGCTTTAGAA TTGCTGCGAA CATCTTCACT GATGCTAGAG 1680 
TTACCATGAT GAGAGAAGTG CTGAATAATA TAAAAATGAT TAAATATTAT ACGTGGGAGG 174 0 
ATGCGTATGA AAAAAATATT CAAGATATTA GGACCAAAGA GATTTCTAAA GTTAGAAAAA 1800 
TGCAACTATC AAGAAATTTC TTGATTGCTA TGGCCATGTC TTTGCCTAGT ATTGCTTCAT 1860 
TGGTCACTTT CCTTGCAATG TACAAAGTTA ATAAAGGAGG CAGGCAACCT GGTAATATTT 1920 
TTGCCTCTTT ATCTTTATTT CAGGTCTTGA GTTTGCAAAT GTTTTTCTTA CCTATTGCTA 1980 
TTGGTACTGG AATTGACATG ATCATTGGAT TGGGCCGTTT GCAAAGCTTA TTGGAGGCTC 204 0 
CAGAAGATGA TCCAAATCAG ATGATTGAAA TGAAGCCCTC TCCTGGCTTT GATCCAAAAT 2100 
TGGCTCTAAA AATGACACAT TGCTCATTTG AGTGGGAAGA TTATGAATTA AACGACGCTA 2160 
TTGAAGAAGC AAAAGGAGAA GCTAAAGATG AAGGTAAAAA GAACAAAAAA AAGCGTAAGG 2220 
ATACATGGGG TAAGCCATCT GCAAGTACTA ATAAGGCGAA AAGATTGGAC AATATGTTGA 2280 
AAGACAGAGA CGGCCCGGAA GAT T TAG AAA AAACTTCGTT TAGGGGTTTC AAGGACTTGA 234 0 
ACTTCGATAT TAAAAAGGGC GAATTTATTA TGATTACGGG ACCTATTGGT ACTGGTAAAT 24 00 
CTTCATTATT GAATGCGATG GCAGGATCAA TGAGAAAAAT TGATGGTAAG GTTGAAGTCA 24 60 
ACGGGGACTT ATTAATGTGT GGTTATCCAT GGATTCAAAA TGCATCTGTA AGAGATAACA 2520 
TCATATTCGG TTCACCATTC AATAAAGAAA AGTATGATGA AGTAGTTCGT GTTTGCTCTT 258 0 
TGAAAGCTGA TCTGGATATT TTACCGGCAG GCGATATGAC CGAAATTGGG GAACGTGGTA 264 0 
TTACTTTATC TGGTGGTCAA AAGGCACGTA TCAATTTAGC CAGGTCTGTT TATAAGAAGA 2700 
AGGATATTTA TGTATTCGAC GATGTCCTAA GTGCTGTCGA TTCTCGTGTT GGTAAACACA 27 60 
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TCATGGATGA ATGTCTAACC GGAATGCTTG CTAATAAAAC CAGAATTTTA GCAACGCATC 2820 
AGTTGTCACT GATTGAGAGA GCTTCTAGAG TCATCGTTTT AGGTACTGAT GGCCAAGTCG 2880 
ATATTGGTAC TGTTGATGAG CTAAAAGCTC GTAATCAAAC TTTGATAAAT CTTTTACAAT 294 0 
TCTCTTCTCA AAATTCGGAG AAAGAGGATG AAGAACAGGA AGCGGTTGTT TCCGGTGAAT 3000 
TGGGACAACT AAAATATGAA CCAGAGGTAA AGGAATTGAC TGAACTGAAG AAAAAGGCTA 3060 
CAGAAATGTC ACAAACTGCA AATAGTGGTA AAATTGTAGC GGATGGTCAT ACTAGTAGTA 3120 
AAGAAGAAAG AGCAGTCAAT AGTATCAGTC TGAAAATATA CCGTGAATAC ATTAAAGCTG 3180 
CAGTAGGTAA GTGGGGTTTT ATCGCACTAC CGTTGTATGC AATTTTAGTC GTTGGAACCA 324 0 
CATTCTGCTC ACTTTTTTCT TCCGTTTGGT TATCTTACTG GACTGAGAAT AAATTCAAAA 3300 
ACAGACCACC CAGTTTTTAT ATGGGTCTTT ACTCCTTCTT TGTGTTTGCT GCTTTCATAT 3360 
TCATGAATGG CCAGTTCACC ATACTTTGCG CAATGGGTAT TATGGCATCG AAATGGTTAA 34 20 
ATTTGAGGGC TGTGAAAAGA ATTTTACACA CTCCAATGTC ATACATAGAT ACCACACCTT 34 80 
TGGGACGTAT TCTGAACAGA TTCACAAAAG ATACAGATAG CTTAGATAAT GAGTTAACCG 354 0 
AAAGTTTACG GTTGATGACA TCTCAATTTG CTAATATTGT AGGTGTTTGC GTCATGTGTA 3600 
TTGTTTACTT GCCGTGGTTT GCTATCGCAA TTCCGTTTCT TTTGGTCATC TTTGTTCTGA 3660 
TTGCTGATCA TTATCAGAGT TCTGGTAGAG AAATTAAAAG ACTTGAAGCT GTGCAACGGT 3720 
CTTTTGTTTA CAATAATTTA AATGAAGTTT TGGGTGGGAT GGATACAATC AAAGCATACC 37 80 
GAAGTCAGGA ACGATTTTTG GCGAAATCAG ATTTTTTGAT CAACAAGATG AATGAGGCGG 38 4 0 
GATACCTTGT AGTTGTCCTG CAAAGATGGG TAGGTATTTT CCTTGATATG GTTGCTATCG 3900 
CATTTGCACT AATTATTACG TTATTGTGTG TTACGAGAGC CTTTCCTATT TCCGCGGCTT 3960 
CAGTTGGTGT TTTGTTGACT TATGTATTAC AATTGCCTGG TCTATTAAAT ACCATTTTAA 4 020 
GGGCAATGAC TCAAACAGAG AATGACATGA ATAGTGCCGA AAGATTGGTA ACATATGCAA 4 080 
CTGAACTACC ACTAGAGGCA TCCTATAGAA AGCCCGAAAT GACACCTCCA GAGTCATGGC 414 0 
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CCTCAATGGG CGAAATAATT TTTGAAAATG TTGATTTTGC 
TAGTTTTAAA AAATCTTAAC TTGAATATCA AGAGTGGGGA 
GTACAGGTGC TGGTAAGTCC ACTATTATGA GTGCCCTTTA 
CAGGTAAAAT TTTAATTGAC AATGTTGATA TAAGTCAGCT 
GAAAATTAGC CATCATTCCA CAAGATCCAG TATTATTTAG 
TAGATCCATT TAATGAGCGT ACAGATGACG AATTATGGGA 
CTATCGCCAA GGATGACTTG CCGGAAGTGA AATTGCAAAA 
ATGGTAAAAT GCATAAGTTC CATTTAGATC AAGCAGTGGA 
CCTTAGGTGA GAGACAACTA TTAGCATTAA CAAGGGCATT 
TGATTTTGGA TGAGGCTACA TCCTCAGTGG ACTACGAAAC 
GTATTGTTGA GGAATTTGGA GATTGTACAA TTTTGTGTAT 
TTGTAAATTA TGATCGTATT CTTGTTTTAG AGAAGGGTGA 
CATGGACGTT GTTTAGTCAA GAAGATAGTA TTTTCAGAAG 
TTGTGGAAAA TGATTTCGAG AACAGAAGTT AATTTATATT 
CTTTTATTTA TTTATATGTT GCCGATGGTA CAAATTAGTA 
CTATGACTTG CAGAAAAAGT TATGTGTGCC ATAGATAGAT 
TATACTCAAA ATTCCGAAAA GAACATTTCA TTTTTTATGA 
GGTCCTTTTT TCATTCTAGA AATATATATT TATACATCAT 
ACTTATTGGG ATGTCTATTT ACTGAATAAA GTATACACAA 
GGCATAAATA GAAAACTTAG AAGTGAAAAT CCTAAAACCG 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 



CTATAGACCT GGTTTACCTA 4 200 
AAAAATTGGT ATCTGTGGTC 4260 
CAGGTTGAAT GAATTGACCG 4320 
GGGACTTTTC GATTTAAGAA 4380 
GGGTACGATT CGCAAGAACT 4440 
TGCATTGGTG AGAGGTGGTG 4500 
ACCTGATGAA AATGGTACTC 4 560 
AGAAGAGGGC TCCAATTTCT 4 620 
GGTCCGCCAA TCAAAAATAT 4 680 
GGATGGCAAA ATCCAAACAC 4 74 0 
TGCTCACAGA CTGAAGACCA 4 800 
AGTCGCAGAA TTCGATACAC 4 860 
CATGTGTTCT AGATCTGGTA .4920 
ATTTGTTGCA TGATTTTTCT 4 980 
CTAGAAAAGA AAACCCACTA 504 0 
ATAATTGCAT ACCCACATCG 5100 
GGCAAACTGA ACAACGCTTC 5160 
TTTCAGAAGA TATTCAAAGA 5220 
AAAACGAATT TAAAATGGAA 5280 
AAGGATATTT CAAATACGTA 5340 



-136- 



(A) LENGTH: 1477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Thr He Thr Val Gly Asp Ala Val Ser Glu Thr Glu Leu Glu 

5 10 15 

Asn Lys Ser Gin Asn Val Val Leu Ser Pro Lys Ala Ser Ala Ser 

20 25 30 

Ser Asp He Ser Thr Asp Val Asp Lys Asp Thr Ser Ser Ser Trp 

35 40 45 

Asp Asp Lys Ser Leu Leu Pro Thr Gly Glu Tyr He Val Asp Arg 

50 55 60 

Asn Lys Pro Gin Thr Tyr Leu Asn Ser Asp Asp He Glu Lys Val 

65 70 75 

Thr Glu Ser Asp He Phe Pro Gin Lys Arg Leu Phe Ser Phe Leu 

80 85 90 

His Ser Lys Lys He Pro Glu Val Pro Gin Thr Asp Asp Glu Arg 

95 100 105 

Lys He Tyr Pro Leu Phe His Thr Asn He He Ser Asn Met Phe 

110 115 120 
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Phe Trp Trp Val Leu Pro lie Leu Arg Val Gly Tyr Lys Arg Thr 

125 130 135 

lie Gin Pro Asn Asp Leu Phe Lys Met Asp Pro Arg Met Ser lie 

140 145 150 

Glu Thr Leu Tyr Asp Asp Phe Glu Lys Asn Met lie Tyr Tyr Phe 

155 160 165 

Glu Lys Thr Arg Lys Lys Tyr Arg Lys Arg His Pro Glu Ala Thr 

170 175 180 

Glu Glu Glu Val Met Glu Asn Ala Lys Leu Pro Lys His Thr Val 

185 190 195 

Leu Arg Ala Leu Leu Phe Thr Phe Lys Lys Gin Tyr Phe Met Ser 

200 205 210 

He Val Phe Ala He Leu Ala Asn Cys Thr Ser Gly Phe Asn Pro 

215 220 225 

Met He Thr Lys Arg Leu He Glu Phe Val Glu Glu Lys Ala He 

230 235 240 

Phe His Ser Met His Val Asn Lys Gly He Gly Tyr Ala He Gly 

245 250 255 

Ala Cys Leu Met Met Phe Val Asn Gly Leu Thr Phe Asn His Phe 

260 265 270 

Phe His Thr Ser Gin Leu Thr Gly Val Gin Ala Lys Ser He Leu 

275 280 285 

Thr Lys Ala Ala Met Lys Lys Met Phe Asn Ala Ser Asn Tyr Ala 
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290 295 300 

Arg His Cys Phe Pro Asn Gly Lys Val Thr Ser Phe Val Thr Thr 

305 310 315 

Asp Leu Ala Arg lie Glu Phe Ala Leu Ser Phe Gin Pro Phe Leu 

320 325 330 

Ala Gly Phe Pro Ala lie Leu Ala lie Cys lie Val Leu Leu lie 

335 340 345 

Val Asn Leu Gly Pro He Ala Leu Val Gly He Gly He Phe Phe 

350 355 360 

Gly Gly Phe Phe He Ser Leu Phe Ala Phe Lys Leu He Leu Gly 

365 370 375 

Phe Arg He Ala Ala Asn He Phe Thr Asp Ala Arg Val Thr Met 

380 385 390 

Met Arg Glu Val Leu Asn Asn He Lys Met He Lys Tyr Tyr Thr 

395 400 405 

Trp Glu Asp Ala Tyr Glu Lys Asn He Gin Asp He Arg Thr Lys 

410 415 420 

Glu He Ser Lys Val Arg Lys Met Gin Leu Ser Arg Asn Phe Leu 

425 430 435 

He Ala Met Ala Met Ser Leu Pro Ser He Ala Ser Leu Val Thr 

440 445 450 

Phe Leu Ala Met Tyr Lys Val Asn Lys Gly Gly Arg Gin Pro Gly 

455 460 465 
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Asn lie Phe Ala 



Met Phe Phe Leu 



He Gly Leu Gly 



Asp Pro Asn Gin 



Pro Lys Leu Ala 



Asp Tyr Glu Leu 



Lys Asp Glu Gly 



Gly Lys Pro Ser 



Met Leu Lys Asp 



Phe Arg Gly Phe 



Phe He Met He 



Leu Asn Ala Met 



Ser Leu Ser Leu 
470 

Pro He Ala He 
485 

Arg Leu Gin Ser 
500 

Met He Glu Met 
515 

Leu Lys Met Thr 
530 

Asn Asp Ala He 
545 

Lys Lys Asn Lys 
560 

Ala Ser Thr Asn 
575 

Arg Asp Gly Pro 
590 

Lys Asp Leu Asn 
605 

Thr Gly Pro He 
620 

Ala Gly Ser Met 



Phe Gin Val 

475 
Gly Thr Gly 

490 
Leu Leu Glu 

505 
Lys Pro Ser 

520 
His Cys Ser 

535 
Glu Glu Ala 

550 
Lys Lys Arg 

565 
Lys Ala Lys 

580 

Glu Asp Leu 

595 
Phe Asp lie 

610 

Gly Thr Gly 
625 

Arg Lys He 
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Leu Ser Leu Gin 
480 

He Asp Met He 
495 

Ala Pro Glu Asp 
510 

Pro Gly Phe Asp 
525 

Phe Glu Trp Glu 
540 

Lys Gly Glu Ala 
555 

Lys Asp Thr Trp 
570 

Arg Leu Asp Asn 
585 

Glu Lys Thr Ser 
600 

Lys Lys Gly Glu 
615 

Lys Ser Ser Leu 
630 

Asp Gly Lys Val 



635 640 645 

Glu Val Asn Gly Asp Leu Leu Met Cys Gly Tyr Pro Trp lie Gin 

650 655 660 

Asn Ala Ser Val Arg Asp Asn He He Phe Gly Ser Pro Phe Asn 

665 670 675 

Lys Glu Lys Tyr Asp Glu Val Val Arg Val Cys Ser Leu Lys Ala 

680 685 690 

Asp Leu Asp He Leu Pro Ala Gly Asp Met Thr Glu He Gly Glu 

695 700 705 

Arg Gly He Thr Leu Ser Gly Gly Gin Lys Ala Arg He Asn Leu 

710 715 720 

Ala Arg Ser Val Tyr Lys Lys Lys Asp He Tyr Val Phe Asp Asp 

725 730 735 

Val Leu Ser Ala Val Asp Ser Arg Val Gly Lys His He Met Asp 

740 745 750 

Glu Cys Leu Thr Gly Met Leu Ala Asn Lys Thr Arg He Leu Ala 

755 760 765 

Thr His Gin Leu Ser Leu He Glu Arg Ala Ser Arg Val He Val 

770 775 780 

Leu Gly Thr Asp Gly Gin Val Asp He Gly Thr Val Asp Glu Leu 

785 790 795 

Lys Ala Arg Asn Gin Thr Leu He Asn Leu Leu Gin Phe Ser Ser 

800 805 810 
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Gin Asn Ser Glu Lys Glu Asp Glu Glu Gin Glu Ala Val Val Ser 

815 820 825 

Gly Glu Leu Gly Gin Leu Lys Tyr Glu Pro Glu Val Lys Glu Leu 

830 835 840 

Thr Glu Leu Lys Lys Lys Ala Thr Glu Met Ser Gin Thr Ala Asn 

845 850 855 

Ser Gly Lys lie Val Ala Asp Gly His Thr Ser Ser Lys Glu Glu 

860 865 870 

Arg Ala Val Asn Ser lie Ser Leu Lys lie Tyr Arg Glu Tyr He 

875 880 885 

Lys Ala Ala Val Gly Lys Trp Gly Phe He Ala Leu Pro Leu Tyr 

890 895 900 

Ala He Leu Val Val Gly Thr Thr Phe Cys Ser Leu Phe Ser Ser 

905 910 915 

Val Trp Leu Ser Tyr Trp Thr Glu Asn Lys Phe Lys Asn Arg Pro 

920 925 930 

Pro Ser Phe Tyr Met Gly Leu Tyr Ser Phe Phe Val Phe Ala Ala 

935 940 945 

Phe He Phe Met Asn Gly Gin Phe Thr He Leu Cys Ala Met Gly 

950 955 960 

He Met Ala Ser Lys Trp Leu Asn Leu Arg Ala Val Lys Arg He 

965 970 975 

Leu His Thr Pro Met Ser Tyr He Asp Thr Thr Pro Leu Gly Arg 
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980 985 990 

lie Leu Asn Arg Phe Thr Lys Asp Thr Asp Ser Leu Asp Asn Glu 

995 1000 1005 

Leu Thr Glu Ser Leu Arg Leu Met Thr Ser Gin Phe Ala Asn lie 

1010 1015 1020 

Val Gly Val Cys Val Met Cys He Val Tyr Leu Pro Trp Phe Ala 

1025 1030 1035 

He Ala He Pro Phe Leu Leu Val He Phe Val Leu He Ala Asp 

1040 1045 1050 

His Tyr Gin Ser Ser Gly Arg Glu He Lys Arg Leu Glu Ala Val 

1055 1060 1065 

Gin Arg Ser Phe Val Tyr Asn Asn Leu Asn Glu Val Leu Gly Gly 

1070 1075 1080 

Met Asp Thr He Lys Ala Tyr Arg Ser Gin Glu Arg Phe Leu Ala 

1085 1090 1095 

Lys Ser Asp Phe Leu He Asn Lys Met Asn Glu Ala Gly Tyr Leu 

1100 1105 1110 

Val Val Val Leu Gin Arg Trp Val Gly He Phe Leu Asp Met Val 

1115 1120 1125 

Ala He Ala Phe Ala Leu He He Thr Leu Leu Cys Val Thr Arg 

1130 1135 1140 

Ala Phe Pro He Ser Ala Ala Ser Val Gly Val Leu Leu Thr Tyr 

1145 1150 1155 
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Val Leu Gin Leu Pro Gly Leu Leu Asn Thr lie Leu Arg Ala Met 

1160 1165 1170 

Thr Gin Thr Glu Asn Asp Met Asn Ser Ala Glu Arg Leu Val Thr 

1175 1180 1185 

Tyr Ala Thr Glu Leu Pro Leu Glu Ala Ser Tyr Arg Lys Pro Glu 

1190 1195 1200 

Met Thr Pro Pro Glu Ser Trp Pro Ser Met Gly Glu lie lie Phe 

1205 1210 1215 

Glu Asn Val Asp Phe Ala Tyr Arg Pro Gly Leu Pro lie Val Leu 

1220 1225 1230 

Lys Asn Leu Asn Leu Asn lie Lys Ser Gly Glu Lys lie Gly lie 

1235 1240 1245 

Cys Gly Arg Thr Gly Ala Gly Lys Ser Thr lie Met Ser Ala Leu 

1250 1255 1260 

Tyr Arg Leu Asn Glu Leu Thr Ala Gly Lys lie Leu lie Asp Asn 

1265 1270 1275 

Val Asp lie Ser Gin Leu Gly Leu Phe Asp Leu Arg Arg Lys Leu 

1280 1285 1290 

Ala lie lie Pro Gin Asp Pro Val Leu Phe Arg Gly Thr lie Arg 

1295 1300 1305 

Lys Asn Leu Asp Pro Phe Asn Glu Arg Thr Asp Asp Glu Leu Trp 

1310 1315 1320 

Asp Ala Leu Val Arg Gly Gly Ala lie Ala Lys Asp Asp Leu Pro 
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1325 1330 1335 

Glu Val Lys Leu Gin Lys Pro Asp Glu Asn Gly Thr His Gly Lys 

1340 1345 1350 

Met His Lys Phe His Leu Asp Gin Ala Val Glu Glu Glu Gly Ser 

1355 1360 1365 

Asn Phe Ser Leu Gly Glu Arg Gin Leu Leu Ala Leu Thr Arg Ala 

1370 1375 1380 

Leu Val Arg Gin Ser Lys lie Leu lie Leu Asp Glu Ala Thr Ser 

1385 1390 1395 

Ser Val Asp Tyr Glu Thr Asp Gly Lys lie Gin Thr Arg lie Val 

1400 1405 1410 

Glu Glu Phe Gly Asp Cys Thr lie Leu Cys lie Ala His Arg Leu 

1415 1420 1425 

Lys Thr lie Val Asn Tyr Asp Arg lie Leu Val Leu Glu Lys Gly 

1430 1435 1440 

Glu Val Ala Glu Phe Asp Thr Pro Trp Thr Leu Phe Ser Gin Glu 

1445 1450 1455 

Asp Ser lie Phe Arg Ser Met Cys Ser Arg Ser Gly lie Val Glu 

1460 1465 1470 

Asn Asp Phe Glu Asn Arg Ser 

1475 

(2) INFORMATION FOR SEQ ID NO: 25: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TTTGGTTAYA TGAAYYTNTT YGGNGT 26 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



TCTACAAART ARTGGTGNGT NARRTACAT 29 



(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTATATATAT TATTGATTTG TTCCTGTTGT TATTTAGTTT AGAATCAGAC GACTACACCA 60 

GAACCACAAT TCAACCAACA CTTATATAGA ACCTGGCTTG GAAAAAAGTA ACATTTATCA 120 

TTCCTATACT TTTTTAGCAA ACATAATCCG TGTTTTACAT ATATTATTCA CCCAATATCA 180 

TAACAAAAAC AAACTGAATA ATGGCGTCTT CTATTTTGCG TTCCAAAATA ATACAAAAAC 24 0 

CGTACCAATT ATTCCACTAC TATTTTCTTC TGGAGAAGGC TCCTGGTTCT ACAGTTAGTG 300 

ATTTGAATTT TGATACAAAC ATACAAACGA GTTTACGTAA ATTAAAGCAT CATCATTGGA 360 

CGGTGGGAGA AATATTCCAT TATGGGTTTT TGGTTTCCAT ACTTTTTTTC GTGTTTGTGG 4 20 

TTTTCCCAGC TTCATTTTTT ATAAAATTAC CAATAATCTT AGCATTTGCT ACTTGTTTTT 48 0 

TAATACCCTT AACATCACAA TTTTTTCTTC CTGCCTTGCC CGTTTTCACT TGGTTGGCAT 54 0 

TATATTTTAC GTGTGCTAAA ATACCTCAAG AATGGAAACC AGCTATCACA GTTAAAGTTT 600 

TACCAGCTAT GGAAACAATT TTGTACGGCG ATAATTTATC AAATGTTTTG GCAACCATCA 660 

CTACCGGAGT GTTAGATATA TTGGCATGGT TACCATATGG GATTATTCAT TTCAGTTTCC 720 

CATTTGTACT TGCTGCTATT ATATTTTTAT TTGGGCCACC GACGGCATTA AGATCATTTG 780 

GATTTGCCTT TGGTTATATG AACTTGCTTG GAGTCTTGAT TCAAATGGCA TTCCCAGCTG 84 0 

CTCCTCCATG GTACAAAAAC TTGCACGGAT TAGAACCAGC TAATTATTCA ATGCACGGGT 900 

CTCCTGGTGG ACTTGGAAGG ATAGATAAAT TGTTAGGTGT TGATATGTAT ACCACAGGGT 960 
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TTTCCAATTC ATCAATCATT TTTGGGGCAT TCCCATCGTT ACATTCAGGA TGTTGTATCA 1020 
TGGAAGTGTT ATTTTTGTGT TGGTTGTTTC CACGATTCAA GTTTGTGTGG GTTACATACG 1080 
CATCTTGGCT TTGGTGGAGC ACGATGTATT TGACCCATCA CTACTTTGTC GATTTGATTG 114 0 
GTGGAGCCAT GCTATCTTTG ACTGTTTTTG AGTTCACCAA ATATAAATAT TTGCCAAAAA 1200 
ACAAAGAAGG CCTTTTCTGT CGTTGGTCAT ACACTGAAAT TGAAAAAATC GATATCCAAG 1260 
AGATTGACCC TTTATCATAC AATTATATCC CTGTCAACAG CAATGATAAT GAAAGCAGAT 1320 
TGTATACGAG AGTGTACCAA GAGTCTCAGG TTAGTCCCCC ACAGAGAGCT GAAACACCTG 1380 
AAGCATTTGA GATGTCAAAT TTTTCTAGGT CTAGACAAAG CTCAAAGACT CAGGTTCCAT 144 0 
TGAGTAATCT TACTAACAAT GATCAAGTGT CTGGAATTAA CGAAGAGGAT GAAGAAGAAG 1500 
AAGGCGATGA AATTTCATCG AGTACTCCTT CGGTGTTTGA AGACGAACCA CAGGGTAGCA 1560 
CATATGCTGC ATCCTCAGCT ACATCAGTAG ATGATTTGGA TTCCAAAAGA AATTAGTAAA 1620 
ATAACAGTTT CTATTAATTT CTTTATTTCC TCCTAATTAA TGATTTTATG CTCAATACCT 1680 
ACACTATCTG TTTTTAATTT CCTACTTTTT TTTTATTATT GTTGAGTTCA TTTGCTGTTC 174 0 
ATTGAATATT TACAATTTTG CATTAATTAC CATCAATATA GAATGGGCAC AGTTTTTTTA 1800 
AGTTTTTTTG TTTTTGTGTT TGTCTTTCTT TTTTTACATT AATGTGTTTG GATTGTTTTA 18 60 
GGTTCCTTTA TCCCTTAGCC CCCTCAGAAT ACTATTTTAT CTAATTAATT TGTTTTTATT 1920 
TTCTGATATT TACCAATTGC TTTTTCTTTT GGATATTTAT AATAGCATCC CCTAATAATT 1980 
AATATACAAC TGTTTCATAT ATATACGTGT ATGTCCTGTA GTGGTGGAAA CTGGAGTCAA 204 0 
CATTTGTATT AATGTGTACA AGAAAGCAGT GTTAATGCTA CTATTATAAT TTTTGAGGTG 2100 
CAAATCAAGA GGTTGGCAGC TTTCTTATGG CTATGACCGT GAATGAAGGC TTGTAAACCA 2160 
CGTAATAAAC AAAAGCCAAC AAGTTTTTTT AGAGCCTTTA ACAACATACG CAATGAGAGT 2220 
GATTGCAATA CTACAAGATA TAGCCCAAAA AATTGAATGC ATTTCAACAA CAAC 227 4 
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INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro Tyr 

5 10 15 

Gin Leu Phe His Tyr Tyr Phe Leu Ser Glu Lys Ala Pro Gly Ser 

20 25 30 

Thr Val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser Leu 

35 40 45 

Arg Lys Leu Lys His His His Trp Thr Val Gly Glu lie Phe His 

50 55 60 

Tyr Gly Phe Leu Val Ser He Leu Phe Phe Val Phe Val Val Phe 

65 70 75 

Pro Ala Ser Phe Phe He Lys Leu Pro He He Leu Ala Phe Ala 

80 . 85 90 

Thr Cys Phe Leu He Pro Leu Thr Ser Gin Phe Phe Leu Pro Ala 

95 100 105 
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Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala Lys 

110 115 120 

He Pro Gin Glu Trp Lys Pro Ala He Thr Val Lys Val Leu Pro 

125 130 135 

Ala Met Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asn Val Leu 

140 145 150 

Ala Thr He Thr Thr Gly Val Leu Asp He Leu Ala Trp Leu Pro 

155 160 165 

Tyr Gly He He His Phe Ser Phe Pro Phe Val Leu Ala Ala He 

170 175 180 

He Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly Phe 

185 190 195 

Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu lie Gin Met Ala 

200 205 210 

Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu Glu 

215 220 225 

Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly Arg 

230 235 240 

He Asp Lys Leu Leu Gly Val Asp Met Tyr Thr Thr Gly Phe Ser 

245 250 255 

Asn Ser Ser He He Phe Gly Ala Phe Pro Ser Leu His Ser Gly 

260 265 270 

Cys Cys He Met Glu Val Leu Phe Leu Cys Trp Leu Phe Pro Arg 
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275 280 285 

Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Trp Leu Trp Trp Ser 

290 295 300 

Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu lie Gly Gly 

305 310 315 

Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys Tyr 

320 325 330 

Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr Thr 

335 340 345 

Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu Ser Tyr 

350 355 360 

Asn Tyr lie Pro Val Asn Ser Asn Asp Asn Glu Ser Arg Leu Tyr 

365 370 375 

Thr Arg Val Tyr Gin Glu Ser Gin Val Ser Pro Pro Gin Arg Ala 

380 385 390 

Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser Arg 

395 400 405 

Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn Asn 

410 415 420 

Asp Gin Val Ser Gly lie Asn Glu Glu Asp Glu Glu Glu Glu Gly 

425 430 435 

Asp Glu lie Ser Ser Ser Thr Pro Ser Val Phe Glu Asp Glu Pro 

440 445 450 
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Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp Asp 

455 460 465 

Leu Asp Ser Lys Arg Asn 

470 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



TTTGAAAAAT TTGAATTTTA AAATTAATCC 

CGGTGCTGGT AAATCATCAA TTATGACAGC 

GAAAATTATT ATTGATGATA TTGATATTTC 

ATTATCAATT ATTCCTCAAG ATCCAGTATT 
TCC 



AATGGAAAAA ATTGGTATTT GTGGAAGAAC 60 
ATTATATCGA TTATCAGAAT TAGAACTGGG 120 
AACTTTGGGT TTAAAAGATC TTCGATCAAA 180 
ATTCCGAGGT TCAATTCGGA AAAACTTGGA 240 

243 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Leu Lys Asn Leu Asn Phe Lys lie Asn Pro Met Glu Lys lie Gly 

5 10 15 

lie Cys Gly Arg Thr Gly Ala Gly Lys Ser Ser lie Met Thr Ala 

20 25 30 

Leu Tyr Arg Leu Ser Glu Leu Glu Leu Gly Lys lie lie lie Asp 

35 40 45 

Asp lie Asp lie Ser Thr Leu Gly Leu Lys Asp Leu Arg Ser Lys 

50 55 60 

Leu Ser lie lie Pro Gin Asp Pro Val Leu Phe Arg Gly Ser lie 

65 70 75 

Arg Lys Asn Leu Asp 

80 



INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 bases 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AGGAAGATGA CTTGCATCAA AGATGGAGGA AGTGGTACTG GCAGGACGAT CAATCAAATC 60 

AGCAGCAGGA CTAGGTAACG GCTCAGGTGA TGATGAACCC ACGGACCATT CATGATCGGT 120 

GTTAGCAAGT TCCATATTGT TAAGACCACT CATGAAGGCT ACTGCATTAG GGTTTTGAGT 180 

AAAAGAATCC CTTCCAAGTA AGTATGGGCT GCCGGTACGA GCCAAGGAGT TGCTGGTTTT 24 0 

TTCGGAAAGA CCATGACCGT GGATAACAAA CTCGTATTCC CAACGAAGGA TTTTACCAGT 300 

TTGCAACTGT GGGAGGCGTA GCTTTTGAGC AAAAACGAAG CATATAATAG CTAAACACAT 360 

ACCGCCGACC AAATCTACAA AGTAGTGGTG GGTAAGGTAC ATAGTACACC AGCAAAGCCA 420 

TAGAACATAT CCATAAAAGC AGAAGCGGTA TCGAGGAAAC ACATGCGAAA GGAAAAGTGC 480 

TTCCAGCATG GCCCATCCAG CGTGAAGAGA TGGAAAGGCA CCAAAAACAA CCGGAGAGTT 54 0 

AGAAAAACCA TCAGTGTAAA TGCTAGTGCC GAAGAGAGCA TCAATACGGG CCAATCCACC 600 

AGGAGAGCCA CGTACTGCAT ACGTGGCAGG TTCTAAACCA TACATATTTT CATACCAAGG 660 

AGGAGAACAG GGGAAAGCCA TTTGGATAAG AACACCAAAT AAATTCATAT AACCAAAAGT 720 

TCGAGCCCAA ACTGGAAGAG TTCCAGGAGG TGCAAAGATG AAAAGAATAA ATGAAATGAT 780 

AAAAGGAGCC GAATAATGCA TGACTCCATA TGGAACCCAG GCCAAAATAT CAAGGATGCT 84 0 

ATGCGTGGTT TTCGAGAGAA GACTAGAAAG ATTAGAGCCA TAAAGAATAT TTTCAAGTGT 900 

GGGTAAAACA CGAACCCATA TGGGTGGACG CCAGCGTTCT GGAATAAACC TACAAGAGTA 960 

AAATAAAATT GCCCAGGTGA TGATAACAAT GGCAGGAAAA AAAATTTGGC GTGTTAAAGG 102 0 
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AACGGTCAAC GCAATGGCCA AAAGACAGGC AATGCCAAAT TTCCCCCAGA ATCCAGGAGA 1080 

TTCAATGACA ATACAAGCAA AAATCAAATT ACCTGCTAGA AACACATATT GCAAATGTGT 1140 

CCATGACCAT TTCGTATTGC GTAGCAAACG AAATGTAGGC ATAGGGTTTA AGCTTGTTTC 1200 

CAACTTGTAT TGGGATGCTC GGTTACACGC AGCAAGGCGC TTTTTTAAGG TCGAAAGAGC 12 60 

AGACATTGCT TCAAAGAATT ATCAGAGTAA AAAAGGGAAG CGTACGAAAA AAATTTCGTA 1320 

AGGAATTAAC CGGAAAACTA AAGGAAAAAA AAGGAATTTT TATGAAGGAA AGAAAGTAGC 1380 

TATTAAATGC AAGTGTCAAG CACTTAAAAG TAGCGATGTA AAATATTTAA AAAAAGATGG 14 40 

ACCGATTAAC CAATGTTCAG CTCACAGTTG CCAGCAATCA GGGCTATTTT TTTATTTTTT 1500 

TTATAAAATT GCTAATTATA TATAATATAA TTAGTTTATT AACTTGCTTT TCCTCAAAAA 1560 

ACCAATTCGA GAAAGGAACT TTTGCAGAGG CAAAAAAGCT T 1601 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 
(iv) ANTI-SENSE: Yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



AGGAAGAUGA CUUGCAUCAA AGAUGGAGGA AGUGGUACUG GCAGGACGAU CAAUCAAAUC 60 
AGCAGCAGGA CUAGGUAACG GCUCAGGUGA UGAUGAACCC ACGGACCAUU CAUGAUCGGU 120 
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GUUAGCAAGU UCCAUAUUGU UAAGACCACU CAUGAAGGCU ACUGCAUUAG GGUUUUGAGU 180 

AAAAGAAUCC CUUCCAAGUA AGUAUGGGCU GCCGGUACGA GCCAAGGAGU UGCUGGUUUU 24 0 

UUCGGAAAGA CCAUGACCGU GGAUAACAAA CUCGUAUUCC CAACGAAGGA UUUUACCAGU 300 

UUGCAACUGU GGGAGGCGUA GCUUUUGAGC AAAAACGAAG CAUAUAAUAG CUAAACACAU 360 

ACCGCCGACC AAAUCUACAA AGUAGUGGUG GGUAAGGUAC AUAGUACACC AGCAAAGCCA 420 

UAGAACAUAU CCAUAAAAGC AGAAGCGGUA UCGAGGAAAC ACAUGCGAAA GGAAAAGUGC 4 80 

UUCCAGCAUG GCCCAUCCAG CGUGAAGAGA UGGAAAGGCA CCAAAAACAA CCGGAGAGUU 54 0 

AGAAAAACCA UCAGUGUAAA UGCUAGUGCC GAAGAGAGCA UCAAUACGGG CCAAUCCACC 600 

AGGAGAGCCA CGUACUGCAU ACGUGGCAGG UUCUAAACCA UACAUAUUUU CAUACCAAGG 660 

AGGAGAACAG GGGAAAGCCA UUUGGAUAAG AACACCAAAU AAAUUCAUAU AACCAAAAGU 720 

UCGAGCCCAA ACUGGAAGAG UUCCAGGAGG UGCAAAGAUG AAAAGAAUAA AUGAAAUGAU 780 

AAAAGGAGCC GAAUAAUGCA UGACUCCAUA UGGAACCCAG GCCAAAAUAU CAAGGAUGCU 84 0 

AUGCGUGGUU UUCGAGAGAA GACUAGAAAG AUUAGAGCCA UAAAGAAUAU UUUCAAGUGU 900 

GGGUAAAACA CGAACCCAUA UGGGUGGACG CCAGCGUUCU GGAAUAAACC UACAAGAGUA 960 

AAAUAAAAUU GCCCAGGUGA UGAUAACAAU GGCAGGAAAA AAAAUUUGGC GUGUUAAAGG 1020 

AACGGUCAAC GCAAUGGCCA AAAGACAGGC AAUGCCAAAU UUCCCCCAGA AUCCAGGAGA 1080 

UUCAAUGACA AUACAAGCAA AAAUCAAAUU ACCUGCUAGA AACACAUAUU GCAAAUGUGU 114 0 

CCAUGACCAU UUCGUAUUGC GUAGCAAACG AAAUGUAGGC AUAGGGUUUA AGCUUGUUUC 1200 

CAACUUGUAU UGGGAUGCUC GGUUACACGC AGCAAGGCGC UUUUUUAAGG UCGAAAGAGC 1260 

AGACAUUGCU UCAAAGAAUU AUCAGAGUAA AAAAGGGAAG CGUACGAAAA AAAUUUCGUA 1320 

AGGAAUUAAC CGGAAAACUA AAGGAAAAAA AAGGAAUUUU UAUGAAGGAA AGAAAGUAGC 1380 

UAUUAAAUGC AAGUGUCAAG CACUUAAAAG UAGCGAUGUA AAAUAUUUAA AAAAAGAUGG 1440 

ACCGAUUAAC CAAUGUUCAG CUCACAGUUG CCAGCAAUCA GGGCUAUUUU UUUAUUUUUU 1500 
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UUAUAAAAUU GCUAAUUAUA UAUAAUAUAA UUAGUUUAUU AACUUGCUUU UCCUCAAAAA 1560 
ACCAAUUCGA GAAAGGAACU UUUGCAGAGG CAAAAAAGCU U 1601 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Cys Phe Thr Ser Ser Tyr Phe Pro Asp Asp Arg Arg 

5 10 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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9 1 

Cys Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro 

5 10 15 

Leu Ala Ala Asp 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1553 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TTTTACATAT ATTATTCACC CAATATCATA ACAAAAACAA ACTGAATGAT GGCATCTTCT 60 

ATTTTGCGTT CCAAAATAAT ACAAAAACCG TACCAATTAT TCCACTACTA TTTTCTTCTG 120 

GAGAAGGCTC CTGGTTCTAC AGTTAGTGAT TTGAATTTTG ATACAAACAT ACAAACGAGT 180 

TTACGTAAAT TAAAGCATCA TCATTGGACG GTGGGAGAAA TATTCCATTA TGGGTTTTTG 240 

GTTTCCATAC TTTTTTTCGT GTTTGTGGTT TTCCCAGCTT CATTTTTTAT AAAATTACCA 300 

ATAATCTTAG CATTTGCTAC TTGTTTTTTA ATACCCTTAA CATCACAATT TTTTCTTCCT 360 

GCCTTGCCCG TTTTCACTTG GTTGGCATTA TATTTTACGT GTGCTAAAAT ACCTCAAGAA 420 

TGGAAACCAG CTATCACAGT TAAAGTTTTA CCAGCTATGG AAACAATTTT GTACGGCGAT 4 80 

AATTTATCAA ATGTTTTGGC AACCATCACT ACCGGAGTGT TAGATATATT GGCATGGTTA 54 0 

CCATATGGGA TTATTCATTT CAGTTTCCCA TTTGTACTTG CTGCTATTAT ATTTTTATTT 600 
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GGGCCACCGA CGGCATTAAG ATCATTTGGA TTTGCCTTTG GTTATATGAA CTTGCTTGGA 660 
GTCTTGATTC AAATGGCATT CCCAGCTGCT CCTCCATGGT ACAAAAACTT GCACGGATTA 720 
GAACCAGCTA ATTATTCAAT GCACGGGTCT CCTGGTGGAC TTGGAAGGAT AGATAAATTG 78 0 
TTAGGTGTTG ATATGTATAC CACAGGGTTT TCCAATTCAT CAATCATTTT TGGGGCATTC 84 0 
CCATCGTTAC ATTCAGGATG TTGTATCATG GAAGTGTTAT TTTTGTGTTG GTTGTTTCCA 900 
CGATTCAAGT TTGTGTGGGT TACATACGCA TCTTGGCTTT GGTGGAGCAC GATGTATTTG 960 
ACCCATCACT ACTTTGTCGA TTTGATTGGT GGAGCCATGC TATCTTTGAC TGTTTTTGAA 1020 
TTCACCAAAT ATAAATATTT GCCAAAAAAC AAAGAAGGCC TTTTCTGTCG TTGGTCATAC 108 0 
ACTGAAATTG AAAAAATCGA TATCCAAGAG ATTGACCCTT TATCATACAA TTATATCCCT 114 0 
GTCAACAGCA ATGATAATGA AAGCAGATTG TATACGAGAG TGTACCAAGA GCCTCAGGTT 1200 
AGTCCCCCAC AGAGAGCTGA AACACCTGAA GCATTTGAGA TGTCAAATTT TTCTAGGTCT 1260 
AGACAAAGCT CAAAGACTCA GGTTCCATTG AGTAATCTTA CTAACAATGA TCAAGTGCCT 1320 
GGAATTAACG AAGAGGATGA AGAAGAAGAA GGCGATGAAA TTTCGTCGAG TACTCCTTCG 1380 
GTGTTTGAAG ACGAACCACA GGGTAGCACA TATGCTGCAT CCTCAGCTAC ATCAGTAGAT 144 0 
GATTTGGATT CCAAAAGAAA TTAGTAAAAC AGCAGTTTCT ATTAATTTCT TTATTTCCTC 1500 
CTAATTAATG ATTTTATGTT CAATACCTAC ACTATCTGTT TTTAATTTCC TAC 1553 



(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Met Ala Ser Ser lie Leu Arg Ser Lys lie lie Gin Lys Pro 

1 "5 10 15 

Tyr Gin Leu Phe His Tyr Tyr Phe Leu Leu Glu Lys Ala Pro Gly 

20 25 30 

Ser Thr Val Ser Asp Leu Asn Phe Asp Thr Asn lie Gin Thr Ser 

35 40 45 

Leu Arg Lys Leu Lys His His His Trp Thr Val Gly Glu lie Phe 

50 55 60 

His Tyr Gly Phe Leu Val Ser He Leu Phe Phe Val Phe Val Val 

65 70 75 

Phe Pro Ala Ser Phe Phe He Lys Leu Pro He He Leu Ala Phe 

80 85 90 

Ala Thr Cys Phe Leu He Pro Leu Thr Ser Gin Phe Phe Leu Pro 

95 100 105 

Ala Leu Pro Val Phe Thr Trp Leu Ala Leu Tyr Phe Thr Cys Ala 

110 115 120 

Lys He Pro Gin Glu Trp Lys Pro Ala He Thr Val Lys Val Leu 

125 130 135 

Pro Ala Met Glu Thr He Leu Tyr Gly Asp Asn Leu Ser Asn Val 

140 145 150 
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Leu Ala Thr lie Thr Thr Gly Val Leu Asp lie Leu Ala Trp Leu 

155 160 165 

Pro Tyr Gly lie lie His Phe Ser Phe Pro Phe Val Leu Ala Ala 

170 175 180 

He He Phe Leu Phe Gly Pro Pro Thr Ala Leu Arg Ser Phe Gly 

185 190 195 

Phe Ala Phe Gly Tyr Met Asn Leu Leu Gly Val Leu He Gin Met 

200 205 210 

Ala Phe Pro Ala Ala Pro Pro Trp Tyr Lys Asn Leu His Gly Leu 

215 220 225 

Glu Pro Ala Asn Tyr Ser Met His Gly Ser Pro Gly Gly Leu Gly 

230 235 240 

Arg He Asp Lys Leu Leu Gly Val Asp Met Tyr Thr Thr Gly Phe 

245 250 255 

Ser Asn Ser Ser He lie Phe Gly Ala Phe Pro Ser Leu His Ser 

260 265 270 

Gly Cys Cys He Met Glu Val Leu Phe Leu Cys Trp Leu Phe Pro 

275 280 285 

Arg Phe Lys Phe Val Trp Val Thr Tyr Ala Ser Trp Leu Trp Trp 

290 295 300 

Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu He Gly 

305 310 315 

Gly Ala Met Leu Ser Leu Thr Val Phe Glu Phe Thr Lys Tyr Lys 
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320 325 330 

Tyr Leu Pro Lys Asn Lys Glu Gly Leu Phe Cys Arg Trp Ser Tyr 

335 340 345 

Thr Glu lie Glu Lys lie Asp lie Gin Glu lie Asp Pro Leu Ser 

350 355 360 

Tyr Asn Tyr lie Pro Val Asn Ser Asn Asp Asn Glu Ser Arg Leu 

365 370 375 

Tyr Thr Arg Val Tyr Gin Glu Pro Gin Val Ser Pro Pro Gin Arg 

380 385 390 

Ala Glu Thr Pro Glu Ala Phe Glu Met Ser Asn Phe Ser Arg Ser 

395 400 405 

Arg Gin Ser Ser Lys Thr Gin Val Pro Leu Ser Asn Leu Thr Asn 

410 415 420 

Asn Asp Gin Val Pro Gly lie Asn Glu Glu Asp Glu Glu Glu Glu 

425 430 435 

Gly Asp Glu lie Ser Ser Ser Thr Pro Ser Val Phe Glu Asp Glu 

440 445 450 

Pro Gin Gly Ser Thr Tyr Ala Ala Ser Ser Ala Thr Ser Val Asp 

455 460 465 
Asp Leu Asp Ser Lys Arg Asn 

470 



INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GACTATTTCA TTATGGGGCC CC 22 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GTTAACTCGA GAAAGTGCCC ATCAGTGTTC 30 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GTTAACGGTA CCTCATCGTT ACACCGTTC 29 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GCTAAACGAC AATCCTGAC 19 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGTTGGCCGA TTCATTAATG C 21 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 
15 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 40 45 
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Val His Tyr 



Thr Asn Pro 



Leu Gly Thr 



Asn Ala Leu 



Ser Tyr Phe 



Leu Pro Ala 



lie Leu Ala 



Leu Pro Tyr 



Ala lie Leu 



Ala Phe Ala 



Asn Val Phe 



Leu Gin Ser 



lie Phe Leu 
50 

Ala Pro Trp 
65 

Leu Phe lie 
80 

Pro lie Leu 
95 

Pro Asp Asp 
110 

Val Glu Thr 
125 

Thr Ser Thr 
140 

Gly Leu Phe 
155 

Phe Val Phe 
170 

Phe Gly Tyr 
185 

Pro Ala Ala 
200 

Ala Asn Tyr 



Gly Ser lie 



lie Phe Lys 



lie Pro Ala 



Thr Trp Val 



Arg Arg Pro 



lie Leu Tyr 



Asn Ser Phe 



His Phe Gly 



Gly Pro Pro 



Met Asn Leu 



Pro Pro Trp 



Asp Met His 



Met Leu Phe 
55 

lie Leu Phe 
70 

Thr Ser Gin 
85 

Ala Leu Tyr 
100 

Pro lie Thr 
115 

Gly Asp Asn 
130 

Leu Asp lie 
145 

Ala Pro Phe 
160 

Thr Val Leu 
175 

Phe Gly Val 
190 

Tyr Lys lie 
205 

Gly Ser Pro 



Val Phe He 
60 

Tyr Cys Phe 
75 

Phe Phe Phe 
90 

Phe Thr Ser 
105 

Val Lys Val 
120 

Leu Ser Asp 
135 

Leu Ala Trp 
150 

Val Val Ala 
165 

Gin Gly Tyr 
180 

He Met Gin 
195 

Leu Tyr Gly 
210 

Gly Gly Leu 
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215 220 225 

Ala Arg lie Asp Lys Leu Leu Gly lie Asn Met Tyr Thr Thr Cys 

230 235 240 

Phe Ser Asn Ser Ser Val lie Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 

Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 300 

Ala Gly Ser Val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu' Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 
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Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 

395 400 



INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Ala Asn Pro Phe Ser Arg Trp Phe Leu Ser Glu Arg Pro Pro 
15 10 15 

Asn Cys His Val Ala Asp Leu Glu Thr Ser Leu Asp Pro His Gin 

20 25 30 

Thr Leu Leu Lys Val Gin Lys Tyr Lys Pro Ala Leu Ser Asp Trp 

35 40 45 

Val His Tyr He Phe Leu Gly Ser He Met Leu Phe Val Phe He 

50 55 60 

Thr Asn Pro Ala Pro Trp He Phe Lys He Leu Phe Tyr Cys Phe 

65 70 75 

Leu Gly Thr Leu Phe He He Pro Ala Thr Ser Gin Phe Phe Phe 
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80 85 90 

Asn Ala Leu Pro lie Leu Thr Trp Val Ala Leu Tyr Phe Thr Ser 

95 100 105 

Ser Tyr Phe Pro Asp Asp Arg Arg Pro Pro lie Thr Val Lys Val 

110 115 120 

Leu Pro Ala Val Glu Thr lie Leu Tyr Gly Asp Asn Leu Ser Asp 

125 130 135 

lie Leu Ala Thr Ser Thr Asn Ser Phe Leu Asp lie Leu Ala Trp 

140 145 150 

Leu Pro Tyr Gly Leu Phe His Tyr Gly Ala Pro Phe Val Val Ala 

155 160 165 

Ala He Leu Phe Val Phe Gly Pro Pro Thr Val Leu Gin Gly Tyr 

170 175 180 

Ala Phe Ala Phe Gly Tyr Met Asn Leu Phe Gly Val He Met Gin 

185 190 195 

Asn Val Phe Pro Ala Ala Pro Pro Trp Tyr Lys He Leu Tyr Gly 

200 205 210 

Leu Gin Ser Ala Asn Tyr Asp Met His Gly Ser Pro Gly Gly Leu 

215 220 225 

Ala Arg He Asp Lys Leu Leu Gly He Asn Met Tyr Thr Thr Cys 

230 235 240 

Phe Ser Asn Ser Ser Val He Phe Gly Ala Phe Pro Ser Leu His 

245 250 255 
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Ser Gly Cys Ala Thr Met Glu Ala Leu Phe Phe Cys Tyr Cys Phe 

260 265 270 

Pro Lys Leu Lys Pro Leu Phe lie Ala Tyr Val Cys Trp Leu Trp 

275 280 285 

Trp Ser Thr Met Tyr Leu Thr His His Tyr Phe Val Asp Leu Met 

290 295 300 

Ala Gly Ser Val Leu Ser Tyr Val lie Phe Gin Tyr Thr Lys Tyr 

305 310 315 

Thr His Leu Pro lie Val Asp Thr Ser Leu Phe Cys Arg Trp Ser 

320 325 330 

Tyr Thr Ser lie Glu Lys Tyr Asp lie Ser Lys Ser Asp Pro Leu 

335 340 345 

Ala Ala Asp Ser Asn Asp lie Glu Ser Val Pro Leu Ser Asn Leu 

350 355 360 

Glu Leu Asp Phe Asp Leu Asn Met Thr Asp Glu Pro Ser Val Ser 

365 370 375 

Pro Ser Leu Phe Asp Gly Ser Thr Ser Val Ser Arg Ser Ser Ala 

380 385 390 

Thr Ser lie Thr Ser Leu Gly Val Lys Arg Ala 

395 400 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATGGCAAACC CTTTTTCGAG ATGGTTTCTA TCAGAGAGAC CTCCAAACTG CCATGTAGCC 60 

GATTTAGAAA CAAGTTTAGA TCCCCATCAA ACGTTGTTGA AGGTGCAAAA ATACAAACCC 120 

GCTTTAAGCG ACTGGGTGCA TTACATCTTC TTGGGATCCA TCATGCTGTT TGTGTTCATT 180 

ACTAATCCCG CACCTTGGAT CTTCAAGATC CTTTTTTATT GTTTCTTGGG CACTTTATTC 24 0 

ATCATTCCAG CTACGTCACA GTTTTTCTTC AATGCCTTGC CCATCCTAAC ATGGGTGGCG 300 

CTGTATTTCA CTTCATCGTA CTTTCCAGAT GACCGCAGGC CTCCTATTAC TGTCAAAGTG 360 

TTACCAGCGG TGGAAACAAT TTTATACGGC GACAATTTAA GTGATATTCT TGCAACATCG 420 

ACGAATTCCT TTTTGGACAT TTTAGCATGG TTACCGTACG GACTATTTCA TTTTGGGGCC 480 

CCATTTGTCG TTGCTGCCAT CTTATTCGTA TTTGGTCCAC CAACTGTTTT GCAAGGTTAT 54 0 

GCTTTTGCAT TTGGTTATAT GAACCTGTTT GGTGTTATCA TGCAAAATGT CTTTCCAGCC 600 

GCTCCCCCAT GGTATAAAAT TCTCTATGGA TTGCAATCAG CCAACTATGA TATGCATGGC 660 

TCGCCTGGTG GATTAGCTAG AATTGATAAG CTACTCGGTA TTAATATGTA TACTACATGT 720 

TTTTCAAATT CCTCCGTCAT TTTCGGTGCT TTTCCTTCAC TGCATTCCGG GTGTGCTACT 780 

ATGGAAGCCC TGTTTTTCTG TTATTGTTTT CCAAAATTGA AGCCCTTGTT TATTGCTTAT 84 0 

GTTTGCTGGT TATGGTGGTC AACTATGTAT CTGACACACC ATTATTTTGT AGACCTTATG 900 

GCAGGTTCTG TGCTGTCATA CGTTATTTTC CAGTACACAA AGTACACACA TTTACCAATT 960 
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GTAGATACAT CTCTTTTTTG CAGATGGTCA 
AAGAGTGATC CATTGGCTGC AGATTCAAAC 
GAACTTGACT TTGATCTTAA TATGACTGAT 
GGATCTACTT CTGTTTCTCG TTCGTCCGCC 
GCTTAA 



TACACTTCAA TTGAGAAATA CGATATATCA 1020 
GATATCGAAA GTGTCCCTTT GTCCAACTTG 1080 
GAACCCAGTG TAAGCCCTTC GTTATTTGAT 114 0 
ACGTCTATAA CGTCACTAGG TGTAAAGAGG 1200 

1206 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATGGCAAACC CTTTTTCGAG ATGGTTTCTA TC AG AG AG AC CTCCAAACTG CCATGTAGCC 60 

GATTTAGAAA CAAGTTTAGA TCCCCATCAA ACGTTGTTGA AGGTGCAAAA ATACAAACCC 120 

GCTTTAAGCG ACTGGGTGCA TTACATCTTC TTGGGATCCA TCATGCTGTT TGTGTTCATT 18 0 

ACTAATCCCG CACCTTGGAT CTTCAAGATC CTTTTTTATT GTTTCTTGGG CACTTTATTC 24 0 

ATCATTCCAG CTACGTCACA GTTTTTCTTC AATGCCTTGC CCATCCTAAC ATGGGTGGCG 300 

CTGTATTTCA CTTCATCGTA CTTTCCAGAT GACCGCAGGC CTCCTATTAC TGTCAAAGTG 360 

TTACCAGCGG TGGAAACAAT TTTATACGGC GACAATTTAA GTGATATTCT TGCAACATCG 420 

ACGAATTCCT TTTTGGACAT TTTAGCATGG TTACCGTACG GACTATTTCA TTATGGGGCC 480 
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CCATTTGTCG TTGCTGCCAT CTTATTCGTA 
GCTTTTGCAT TTGGTTATAT GAACCTGTTT 
GCTCCCCCAT GGTATAAAAT TCTCTATGGA 
TCGCCTGGTG GATTAGCTAG AATTGATAAG 
TTTTCAAATT CCTCCGTCAT TTTCGGTGCT 
ATGGAAGCCC TGTTTTTCTG TTATTGTTTT 
GTTTGCTGGT TATGGTGGTC AACTATGTAT 
GCAGGTTCTG TGCTGTCATA CGTTATTTTC 
GTAGATACAT CTCTTTTTTG CAGATGGTCA 
AAGAGTGATC CATTGGCTGC AGATTCAAAC 
GAACTTGACT TTGATCTTAA TATGACTGAT 
GGATCTACTT CTGTTTCTCG TTCGTCCGCC 
GCTTAA 



TTTGGTCCAC CAACTGTTTT GCAAGGTTAT 54 0 
GGTGTTATCA TGCAAAATGT CTTTCCAGCC 600 
TTGCAATCAG CCAACTATGA TATGCATGGC 660 
CTACTCGGTA TTAATATGTA TACTACATGT 720 
TTTCCTTCAC TGCATTCCGG GTGTGCTACT 780 
CCAAAATTGA AGCCCTTGTT TATTGCTTAT 84 0 
CTGACACACC ATTATTTTGT AGACCTTATG 900 
CAGTACACAA AGTACACACA TTTACCAATT 960 
TACACTTCAA TTGAGAAATA CGATATATCA 1020 
GATATCGAAA GTGTCCCTTT GTCCAACTTG 1080 
GAACCCAGTG TAAGCCCTTC GTTATTTGAT 1140 
ACGTCTATAA CGTCACTAGG TGTAAAGAGG 1200 

1206 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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ATGGCAAACC CTTTTTCGAG ATGGTTTCTA TCAGAGAGAC 
GATTTAGAAA CAAGTTTAGA TCCCCATCAA ACGTTGTTGA 
GCTTTAAGCG ACTGGGTGCA TTACATCTTC TTGGGATCCA 
ACTAATCCCG CACCTTGGAT CTTCAAGATC CTTTTTTATT 
ATCATTCCAG CTACGTCACA GTTTTTCTTC AATGCCTTGC 
CTGTATTTCA CTTCATCGTA CTTTCCAGAT GACCGCAGGC 
TTACCAGCGG TGGAAACAAT TTTATACGGC GACAATTTAA 
ACGAATTCCT TTTTGGACAT TTTAGCATGG TTACCGTACG 
CCATTTGTCG TTGCTGCCAT CTTATTCGTA TTTGGTCCAC 
GCTTTTGCAT TTGGTTATAT GAACCTGTTT GGTGTTATCA 
GCTCCCCCAT GGTATAAAAT TCTCTATGGA TTGCAATCAG 
TCGCCTGGTG GATTAGCTAG AATTGATAAG CTACTCGGTA 
TTTTCAAATT CCTCCGTCAT TTTCGGTGCT TTTCCTTCAC 
ATGGAAGCCC TGTTTTTCTG TTATTGTTTT CCAAAATTGA 
GTTTGCTGGT TATGGTGGTC AACTATGTAT CTGACACACC 
GCAGGTTCTG TGCTGTCATA CGTTATTTTC CAGTACACAA 
GTAGATACAT CTCTTTTTTG CAGATGGTCA TACACTTCAA 
AAGAGTGATC CATTGGCTGC AGATTCAAAC GATATCGAAA 
GAACTTGACT TTGATCTTAA TATGACTGAT GAACCCAGTG 
GGATCTACTT CTGTTTCTCG TTCGTCCGCC ACGTCTATAA 
GCTTAA 

(2) INFORMATION FOR SEQ ID NO: 47: 



CTCCAAACTG CCATGTAGCC 60 

AGGTGCAAAA ATACAAACCC 120 

TCATGCTGTT TGTGTTCATT 180 

GTTTCTTGGG CACTTTATTC 240 

CCATCCTAAC ATGGGTGGCG 300 

CTCCTATTAC TGTCAAAGTG 360 

GTGATATTCT TGCAACATCG 420 

GACTATTTCA TTATGGGGCC 480 

CAACTGTTTT GCAAGGTTAT 54 0 

TGCAAAATGT CTTTCCAGCC 600 

CCAACTATGA TATGCATGGC 660 

TTAATATGTA TACTACAGCT 720 

TGCATTCCGG GTGTGCTACT 780 

AGCCCTTGTT TATTGCTTAT 840 

ATTATTTTGT AGACCTTATG 900 

AGTACACACA TTTACCAATT 960 

TTGAGAAATA CGATATATCA 1020 

GTGTCCCTTT GTCCAACTTG 1080 

TAAGCCCTTC GTTATTTGAT 114 0 

CGTCACTAGG TGTAAAGAGG 1200 

1206 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATGGCAAACC CTTTTTCGAG ATGGTTTCTA TCAGAGAGAC CTCCAAACTG CCATGTAGCC 60 

GATTTAGAAA CAAGTTTAGA TCCCCATCAA ACGTTGTTGA AGGTGCAAAA ATACAAACCC 120 

GCTTTAAGCG ACTGGGTGCA TTACATCTTC TTGGGATCCA TCATGCTGTT TGTGTTCATT 180 

ACTAATCCCG CACCTTGGAT CTTCAAGATC CTTTTTTATT GTTTCTTGGG CACTTTATTC 240 

ATCATTCCAG CTACGTCACA GTTTTTCTTC AATGCCTTGC CCATCCTAAC ATGGGTGGCG 300 

CTGTATTTCA CTTCATCGTA CTTTCCAGAT GACCGCAGGC CTCCTATTAC TGTCAAAGTG 360 

TTACCAGCGG TGGAAACAAT TTTATACGGC GACAATTTAA GTGATATTCT TGCAACATCG 420 

ACGAATTCCT TTTTGGACAT TTTAGCATGG TTACCGTACG GACTATTTCA TTTTGGGGCC 4 80 

CCATTTGTCG TTGCTGCCAT CTTATTCGTA TTTGGTCCAC CAACTGTTTT GCAAGGTTAT 540 

GCTTTTGCAT TTGGTTATAT GAACCTGTTT GGTGTTATCA TGCAAAATGT CTTTCCAGCC 600 

GCTCCCCCAT GGTATAAAAT TCTCTATGGA TTGCAATCAG CCAACTATGA TATGCATGGC 660 

TCGCCTGGTG GATTAGCTAG AATTGATAAG CTACTCGGTA TTAATATGTA TACTACAGCT 720 

TTTTCAAATT CCTCCGTCAT TTTCGGTGCT TTTCCTTCAC TGCATTCCGG GTGTGCTACT 7 80 

ATGGAAGCCC TGTTTTTCTG TTATTGTTTT CCAAAATTGA AGCCCTTGTT TATTGCTTAT 84 0 

GTTTGCTGGT TATGGTGGTC AACTATGTAT CTGACACACC ATTATTTTGT AGACCTTATG 900 
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I 



GCAGGTTCTG TGCTGTCATA CGTTATTTTC 
GTAGATACAT CTCTTTTTTG CAGATGGTCA 
AAGAGTGATC CATTGGCTGC AGATTCAAAC 
GAACTTGACT TTGATCTTAA TATGACTGAT 
GGATCTACTT CTGTTTCTCG TTCGTCCGCC 
GCTTAA 



CAGTACACAA AGTACACACA TTTACCAATT 960 
TACACTTCAA TTGAGAAATA CGATATATCA 1020 
GATATCGAAA GTGTCCCTTT GTCCAACTTG 1080 
GAACCCAGTG TAAGCCCTTC GTTATTTGAT 1140 
ACGTCTATAA CGTCACTAGG TGTAAAGAGG 1200 

1206 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



AATATGTATA CTACATGTTT TTCAAATTCC 30 



(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTTAACTCGA GAAAGTGCCC ATCAGTGTTC 30 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



GTTAACGGTA CCAGAGGAAA GAATAACGC 29 



